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PREFACE 


Obj ectives. The task I objective was to develop and implement, in a 
new geographic area, timber inventory procedures using data sources including 
Landsat, conventional aerial photography, and direct measurements. The 
task II objective was to develop and begin testing a planning model for 
evaluating saj.Tpling and remote sensing measurement factors as they interact 
in sampling-estimation systems. 

Scope of work. The Sam Houston National Forest (145,332 acres) in 
Texas was surveyed to estimate parameters, as nearly as practicable, required 
by the U.S. Forest Service Forest Survey. The planning model used a 
multivariate approach to sample survey design, a population representation 
based on Landsat and supporting data, and concentrated on sampling systems 
which could utilize remote sensing data sources. 


Conclusions . Because of vegetation and terrain homogeneity, Landsat 
data sources were not useful in identifying vegetation class in the detail 
required by the Forest Service. However, for less detailed classifications 
Landsat may still be useful. The survey system utilized Forest Service 
vegetation stratification data and large scale sample photography in a 
system which estimated many of the parameters required by Forest Survey. 
Growing stock volume was estimated as 384 million cubic feet with a relative 
error estimate of 7.8 per cent. Shortcomings of the inventory system as 
well as estimates provided in addition to those required by Forest Survey ahb 
discussed. 




Tlie planning model developed here represents a formalization of the 
interrelationships which must be considered in planning sample surveys for 
geographic multivariate applications. Verification tests and an application 
of the model to the Quincy Ranger District in California have been completed. 
In management applications where a cell-by-cell population model, such as the 
Landsat -based model developed in this study, is available, the planning 
model permits evaluation of the effects of a number of factors as they 
interact in a particular sampling context. These include 1) size and shape 
of sample units, 2) selection probability, 3) precision and probability 
levels, 4) sampling strategy, and 5) cost variables. Finally, application of 
the model could easily be extended to other disciplines using geographic 
populations such as agriculture, range, forestry, and regional planning. 


Summary of recommendations , With additional training and specifications 
of variables required for measurement, the kind of sampling system used 
in this study should be able to provide estimates of all parameters required 
by Forest Survey. A joint application by both Forest Service and research 
personnel would provide the necessary details to bring this approach into 
operational use. Additional work on the planning model is needed to 1) 
increase computational efficiency and the size of populations and parameter 
sets which can be evaluated by the model, 2) incorporate additional capabilities, 
especially more sampling and measurement alternatives, and 3) further test 
and evaluate the model with changing assumptions and input parameters. 
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1 . 0 INTRODUCTION 


The Remote Sensing Research Program (RSRP^ at the University of 
California, Berkeley, has been involved in remote sensing-aided inventory 
systems research since 1973, first through the ERTS investigation 
program and most recently the Forestry Applications Project. This research 
has been directed towards solving the problem of meeting informational 
needs of the resource managers utilizing remote sensing data sources 
including satellite data, conventional aerial photography, and direct 
measurement on the ground in such combinations as needed to best achieve 
these goals. It is recognized that sampling plays an important role in 
generating relevant information for managing large geographic populations. 
The central problem, therefore, is to define the kind and amount of 
samplinv and the place of remote sensing data sources in that sampling 
sysfe.. to do the best possible job of meeting the manager’s informational 
needs , 


Perhaps the most significant recent event which shows the increasing 
importance of implementing comprehensive procedures for describing and 
monitoring wildland systems, is the passage of the Forest and fengeland 
Renewable Resources Planning Act (RPA;) of 1974 and related legislation such 
as the Multiple Use-Sustained Yield Act of 1960 and the National 
Environmental Policy Act. This legislation is directed primarily towards 
management activities of the U.S. Forest Service, but the inplications 
for other federal and state agencies as well as industrial and 
nonindustrial private lands are present. In particular, the RPA specifies 
procedures to be followed in developing comprehensive policies and 
programs with respect to wildland management and requires frequent 
resource assessments to evaluate progress towards these ends. The 
necessity to consider all the varied uses and products associated with these 
lands has shown the need for extensive broadly-based information on a 
variety of wildland characteristics including forests, range, fish and 
wildlife, recreation and wilderness, land and water, and human and 
community development. It is not at all clear what type of information 
production procedures will evolve to meet these needs, hut the strong 
implication is that sampling systems will play a significant role and 
that multiple characteristics will be of interest (Hyde, 1976., p. 283). 

With this premise, a systematic procedure for evalxiating sampling 
alternatives, where the objective is to estimate several parameters, is of 
obvious importance. 

1.1 Contract Background 

Work tinder the current contract included two separate but related 
tasks. Task I involved timber inventory of the Sam Houston National Forest 
in Texas, and Task II was the development of a multivariate survey design 
planning model. Chapter Two reports on the survey of the Sam Houston NF. 
Chapter Three describes the development of the planning model including 
verification tests and application to the Quincy Ranger District of the 
Plumas NF in California. Chapter One provides background on the informational 


needs of wildland resource managers and Chapter Four summarizes and makes 
recommendations for future research. 

Task I also involved developing procedures for a "level II” 
inventory of small selected parcels within the Sam Houston HF. This work 
as well as a part of Task II concerned with developing a plan to test 
the planning model in the same area was not possible since Landsat MSS 
data were not capable of providing the kind of population data required 
by the planning model. Instead^ planning, for a "level II'- inventory is 
considered in a general context in Chapter Three as a possible application 
of the planning model in areas where such data is available. 

1.2 Wildland Management and Information Systems 

A wildland resource system may be partitioned into two interrelated 
subsystems; the forest ecosystem and the management system, as shown in 
Figure 1. 

The ecosystem is the physical entity to be managed. It is defined 
by geographic limits and implies inclusion of the atmosphere above and 
the interior of the earth below. An example would be the ecosystem 
specified by the boundaries of a typical Forest Service Ranger District. 

Two aspects of the ecosystem are of major concern to the forest manager; 
site — components including atmosphere, 'climate, soil, geology, and 
topography; and community — the living component of the system, plants 
and animals. In addition, the ecosystem itself must be considered as 
part of a larger system, which has a certain influence and creates the 
environment within which the ecosystem exists. Macroclimate, fauna, 
and other natural processes from this larger system affect the ecosystem, 
and conversely the ecosystem may affect the larger system. 

The management system consists of man’s activities and efforts to 
manipulate and control the ecosystem. Historically, he has manipulated 
the ecosystem to acquire the things he needed or wanted with little 
regard for the long term consequences of his actions. However,, as the 
scale of demands on the ecosystem has increased so has understanding 
of ecosystem complexities and interactions. The effects of past 
activities have shown that often the short-term achievement of needs 
has adversely affected long term availability of other necessary resources. 
Consequently both our evaluations of the ecosystem as it exists, and our 
view of what the system should look like must be considered as we evaluate 
treatments which might be applied to the ecosystem. "Treatment” is an 
all-inclusive term meaning "an activity carried out by man to alter 
the system". It could be a complex maintenance and harvesting plan extending 
over 5 to 10 years, or it could be the opposite extreme, no activity. 

The desirability, or necessity, of applying treatments depends on Cl) the 
apparen t state of the system and (2) the de&ired state of the system. 

The latter is a transformation of policy, philosophy, and political objectives 
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Figure 1. MANAGEMENT SYSTEM and the object of ■. j 
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into some specific desired or optimal state of the system. A comparisdon 
of the apparent with the desired state allows evaluation of the need for 
treatments to remove or redace existing discrepancies. Additional 
decision-mahing techniques may then be used to determine the 
operational specifications of treatments. The apparent state is based 
on an interpretation of the actual state; however, this 
interpretation is imperfect and subject to errors of various types, 
including possible bias. Hie term apparent state recognizes this 
potentially important distinction between what we think is there and 
what is actually there. Further, while the ecosystem includes all the 
varied components of the physical and biological complex, the desired 
and apparent states typically include only a limited number of particular 
aspects or components of the overall system. 

Both the ecosystem and the management system may be viewed as 
composed of a number of subsystems , In Figure 1, the management system 
is shown as - interacting with the ecosystem primarily through the 
information system and direct applications of treatments- It is the 
information system which provides policy-makers and decision-makers 
with estimates of current conditions so that informed management decisions 
may be made and appropriate treatments applied to the ecosystem. 

1.2.1 The Information System: A Major Management Subsystem 

The information system is linked with the ecosystem through 
data gathering, as well as the decision and action system that utilizes 
the information provided. The kind of information required by the 
decision system determines the nature of the information system and the 
kinds of activities v?hich are necessary to produce the desired information 
products. If well-established decision vnodels exist, the specific informa- 
tion requirements will be known. On the other hand, in the early stages of 
management, when little is knovm about the ecosystem and treatment 
alternatives are not knoim or specified, then the requirements may he 
vague. In this case the description phase may amount to obtaining 
information believed to be important in understanding the system prior 
to determining treatment possibilities. 

Major activities and components of an information system, Figure 2 , 
relate to data acquisition, handling and storage, information production, 
and maintenance. The common factor linking all the components is the data 
base itself. In addition to all of these, an internal control or 
management function is also essential for larger systems . 

Data Aequisifion is the first step in a series of activities which 
ultimately provides information products. As with the other activities it 
is important that acquisition be focused on obtaining data which is 
appropriate for the decisions that must be made by the manager. Many typ^s 
of acquisition are possible ranging from almost continuous monitoring to 
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periodic sample observations or complete enumeration. However, the quality, 
amount, and relevance o£ the data base must be managed within economic bounds; 
hence the importance of developing procedures for the use of sampling 
techniques . 

Data handling and storage has received considerable attention in 
literature on geographic information systems CTomlinson, 1972), Many 
options exist for data manipulation and storage; the important point 
however, is that the particular data structure selected must be flexible 
and that a good geographic reference system be incorporated as an integral 
part of the structure. This is particularly significant where special 
purpose summaries require combining data from several sources. Such a 
requirement would arise, for example, in answering the question, where are 
lands with 1) slope less than 35%, 2) soil type A, and 3) less than 
1320 feet from the nearest road? 

Maintenance and update of data files are particularly important 
so that when new information requirements arise maximum advantage is 
taken of currently available data. Sample survey design can be much more 
efficient if useable data files are maintained. All too often, however, 
the basic data is lost or confused as time passes, requiring costly 
independent sampling, A real commitment to information system maintenance 
is required to provide data continuity for long term management use. 

Information production includes analysis, reduction, and display 
of output products in a convenient format for use by decision-makers. 

Usually both graphic and tabular output is required and some indication 
of accuracy or precision of in.foxmation is desirable. A stratified 
sample could provide all these if the strata are mapped as an overlay to 
a management base map, and the sample estimates are tabulated as a 
complement to the strata map. Prognosis for future developments could 
be handled either as an information system function or as part of 
decision system activities. 

A data base is the common thread which ties the whole system 
together, and must relate on the one side to management needs and on the 
other to basic ecosystem characteristics. Specific types of information 
required vary for different management areas, but all basically relate to 
particular ecosystem components, and to the time, place, and manner in 
which treatment activities will be applied. 

Based on its definition, three primary informational components, 
or dimensions, of an ecosystem are evident: 1) time, 2) location, and 

3) characteristics, or profiles of information. A time dimension is 
necessary since it influences all of man's activities. Geographic location 
within the system is useful since activities are planned for specific parts 
of the system. Finally , characteristics, or profiles of information, axe 
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1 -equired for use in management planning and decision-making. An information 
profile is simply some characteristic of interest, such as vegetation 
type, elevation, slope class, soil type, or any other variable which is 
location specific. These three dimensions define a model of the system, 
portrayed graphically in Figure 3. Time and profiles may be considered 
on an ordinal scale, e.g. time ti, tg etc. and profile 1,2, etc. with 
appropriate explicit definitions for each. 

Definition of ecosystem profiles may be accomplished using a 
hierarchical structure to whatever level of resolution is desired. Given 
specific management activities (which may also be defined hierarchically) 
it is possible to relate profiles and activities through an "association" 
matrix (Leopold, et al, 1971) which may identify influences or reference 
the nature and magnitude of the influence. Table 1 gives an example of 
an association matrix for a wildland ecosystem. The character in each 
cell of the table determines the relative nature of relationships between 
the profile and activities. It would also be possible to use the cell 
entry as a "pointer" to a more specific quantitative statement of the 
relationships. 

1.2.2 Data Acquisition and Information Production 

Information production is the process of translating basic data elements 
into a useable summary format, and methods are largely dependent on the 
manner in which basic data is acquired. Data acquisition may be accomplished 
in many ways, and opportunities for error are associated with each. 

As will be seen in the following paragraphs, sampling is one technique 
that permits control of errors while at the same time providing a measure 
of precision associated with information products. The population sise and 
ease of direct measurement as well as other considerations, such as 
time and money available for making measurements, influence the decision 
on what acquisition methods might be used. Three possible sources of error 
affect information production: specification, measurement, and sampling 

error. 


Specification errors occur in cases where some incorrect decision 
in the planning phase is made which means that although sampling and 
measurement errors may be small the required information is still not 
provided. This situation can arise if there is insufficient communication 
between people vrho use information and those involved with its production. 
Hurried planning efforts often prevent full analysis of information needs. 
Changes in requirements during the information production phase, and 
occasionally misunderstanding can lead to improper specification of what 
is needed. In addition, it is possible that incomplete specifications 
of the sampling population could lead to a survey of the wrong subjects. 
Use of an inappropriate statistical model would also be a specification 
error since even though estimates might be provided, they would be 
inherently wrong. 


Figure 3. 
Location: 
Time: 

Profiles 



A Three Dimensional Data Base and Model for an ECOSYSTEM 


Geographic Location within the System, e.g, UTM Identifier 
(or code); likely varying with Point, line and area data 

Historic records 
Present State Data 
Future Prognosis 

Specific data profiles of interest to one or more management 
areas as determined by Management profiles association matrix 
(Including precision rating) 
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Table 1. Management - Information Profiles Association Matrix 
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Measurement error occurs to some degree because of instrxanentation 
limits or human errors in recording data. The first can be measured, but 
errors in using the instrument are very difficult to evaluate. Errors can 
be made in taking readings, recording data, transferring data, and 
Summarization. Also some Items might be omitted completely. A final 
unpredictable factor is the physical and emotional state of the 
Individuals making measurements. 

Sampling error is introduced in those cases where only a part of the 
population is examined. Inferences about the population based on a 
single sample may or niay not provide a good representation of the true 
characteristics. However, sample ng error can frequently be measured since 
it is possible, at least theoretically, to take repeated samples from a 
population and evaluate variability in inferences. 

All error components can be controlled to some degree. However, 
increased accuracy and precision often results in higher costs, hence 
there is an economic interface with the level of error desired in a 
measurement system. Specification errors can be eliminated by careful 
consideration of the infoomtation needs and appropriate translation -into 
data requirements and statistical models. Instrument error can be 
kept in line with the magnitude of other error components by selection of 
appropriate measurement equipment. For example, several possibilities for 
tree volume measurement are available including diameter tape and volume 
tables, relascope, dendrometer, and cut-and-measure . Human errors can 
be controlled to some degree by selection of who is to make measurements, 
the training given them, and the environment in which they work. Double 
sampling and field checking of measurements can help to estimate and 
reduce these errors, but they are the most difficult to evaluate. 

Sampling error is usually measured by the variance of an estimator 
and is a function of sample size. Intuitively, the larger the sample the 
more precise the estimates are. At the limit, the entire population is 
measured and sampling error is zero. Measurement error is present 
even with sample data, but since these data sets are usually much smaller 
than the population more care can be taken to control measurement error. 
Judicious allocation of the number and kind of sample observations helps 
insure that the required information is obtained for the least cost. 

Considering these error sources and opportunities for their control, 
it is obvious that in many cases sampling is an excellent tool for 
information production. It is usually less expensive than complete enumeration 
and permits one to reduce the measurement error substantially. By 
considering tradeoffs between costs and precision of the information 
derived, sampling error can be controlled to the degree required 
(or to the limit of budget resources) . Population inferences are then 


"better’” than would be obtained with complete enumeration since jneasux’ement 
errors are substantially reduced and sampling error is estimated. 

Sampling represents an important first step in a sequence of 
activities which ultimately may have a significant impact on the ecosystem. 
The effort that goes into designing a sampling system should therefore 
reflect the relative significance of decisions which are to be made. With 
the importance of sampling and its place in management clearly established, 
the central problem of this study is now considered: how to design the 

best possible sampling system, within the limits of objectives and 
constraints imposed by management and the ecosystem. 


2.0 SAM HOUSTON NATIONAL FOREST INVENTORY 


The Sara Houston National Forest CSHNFD is located in the southern 
pine region and was selected by NASA as a second area for potential 
applifiition of remote sensing data in a forest inventory. Initially, the 
orientation was to be toward meeting the needs of the national forest 
managers for timber management planning data. Midway through the 
project but before the survey design had been finalized the emphasis 
was shifted from local requirements to national data needs as outlined 
by the Forest Survey .Handbook [PSH 4809.11). This change in emphasis 
was made in response to the requirements for frequent evaluations of the 
forest resource as required by the Resources Planning Act, of 1974. At the 
same time there are two related aspects which were addressed in this 
study as well. First was the need for broad based resource surveys 
rather than timber or range surveys conducted separately, and second was 
the possibility that local manager’s needs may still be met, partially, 
if not completely, in the same process that acquired data required hy 
the Forest Survey. 

2 , 1 Estimation Objectives 


Forest Survey estimation requirements are currently under revision, 
but the most recent available version of the Forest Survey Handbook was 
used as a guide for implementation in this project (PSH 4809.11 dated 
February 1972, Amendment No. 6). Estimates are required for the entire 
forest and include 1) areas, 2) number of trees, 3) volume, 4) growth, 
and S) mortality with each estimate broken down into various subclasses 
based on species, size class, tree class, and several others. (For the 
most definitive statements of the requirements reference to the handbook 
is required.) A number of output tables are given with specifications 
for each. These same identifiers are used in the output from this 
project. Evaluations are included for each case identifying what was 
provided in addition to requirements, as well as shortcomings and possibilities 
for their correction, in an operational system. The major additional 
item provided by this survey is an indication of the precision achieved 
for each estimate. 

2.2 Development of the Sampling System 

Given the estimation objectives outlined above a sampling system was 
constructed which would, as nearly as practicable, meet those objectives 
by taking maximum advantage of auxiliary information available. The 
potential auxiliary data that were considered included satellite data, 
aerial photography data, Forest Service type delineations., and precise 
yield data for trees on the SHNF. 


2.2.1 Population and Parameter Specification 

The SHNF including only Federal lands constitutes the population 
of interest. Table 2 summarizes the set of tables which contain estimates 
of parameters as required by the Forest Survey. In addition to the 28 
tables (broken down broadly into area statistics, forest estimates, 
product outputs, and projections), are generalized type maps. For a 
variety of reasons not all the tables are produced by this study and for 
tables produced the specifications given by FSH 4809.11 are not always 
obtained. Where discrepancies occur they are identified and discussed. 

2.2.2 Auxiliary Variables Available 

The aim of this study was to design a survey to take maximum 
advantage of available auxiliary data. Several data sources were available 
including remote sensing data of the type utilized in the Plumas 1974 
inventory (Titus et al, 1975, Colwell, 1974). Landsat MSS data provided 
useful information in the first stage of the Plumas inventory. For 
the SHNF, however, after considerable effort was expended in attempts to 
relate spectral data to vegetation classes (timber type/condition classes) 
it was concluded that the uniformity between vegetation classes as sensed 
by the satellite prevented meaningful discriminant analysis of the data. 

This conclusion was supported by a subjective comparison of terrain, 
vegetation, and climatic conditions on the SHNF as contrasted with the 
Plumas. On the SHNF terrain is very flat, climate is humid and wet, and 
vegetation is very homogeneous. Even differences between hardwood and 
softv;oods were difficult to detect on large scale aerial photography except 
at certain times of the year. Forest Service vegetation data were available, 
including maps and acreages by a detailed vegetation classification. 

Large scale photography of the type used in the Plumas study enabled 
measurement on a selected sample area. Individual tree photo measurements 
were considered impractical because of vegetation density and difficulties 
of seeing the ground for parallax measurements. Species separations other 
than hardwood-softwood were impossible; even on the ground species 
differences for pines can be difficult to detect. Instead, stem counts and 
percent crown closure of conifers, hardwoods, and snags were taken as 
well as an average difference of parallax which was relatively well 
correlated with stand height in a small pilot study. Finally, precise 
on-the-ground measurements of Volume are costly to obtain so typically 
local volume table'; are used. A previous study for the SHNF (Baker, 1975) 
provided dendrometer measurements for a sample of trees. The volume 
relationships shown in Baker *s study were so good that additional dendro- 
meter measurements as part of the SHNF inventory were considered unnecessary. 
These local volume relationships use DBH as the independent variable and 
predict cubic foot volume of the tree. 

2.2.3 Sampling Technique and Estimators 

The sampling system for the Plumas 1974 inventory was basically 
a stratified multistage system in which Landsat data were used to make 


Table 2, Forest Survey Tabular Output Summary 



1 Area by land classes 

2 Area of commercial forest land by ownership 

3 Area of commercial forest land by stand size and ownership 

4 Area of commercial forest land by stand volume and ownership 

5 Area of commercial forest land by stocking classes and tree 

class 


6 Area of commercial forest land by stand condition and ownership 

7 Area of commercial forest land by site and ownership 

8 Area of commercial forest land by forest type and 

ownership 

9 Area of commercial forest land by forest type 

10 Number of growing stock trees on commercial forest land by species 
and diameter class 


11 Net volume on commercial forest land by softwood and hardwood 

12 Softwood net volume of growing stock and sawtimber by ownership 


13 Growing stock volume by species and diameter class 

14 Sawtimber volume by species and diameter class 

15 Sawtimber volume by species and quality class 

16 Growing stock net growth and removals of growing stock by 

species 

17 Growing stock net growth and removals of growing stock by 
ownership 

18 Sawtimber net growth and removals by species 

19 Sawtimber net growth and removals by ownership 

20 Mortality of growing stock and sawtimber by species 

21 Softwood mortality of grov;ing stock and sawtimber by ownership 

22 Mortality of growing stock and sawtiimber by causes 

23-27 Product output tables 

28 Projections 




the stratification as well as define the primary sampling units, (Colwell, 
1974) . For the SHNF the use of Landsat data was not feasible so the 
Forest Service stratification was utilized instead. For this reason as 
well, sampling units consisted of variable size clusters located at 
random within the forest areas and with allocation among strata 
proportional to area. The clusters were defined by location of 0.4 acres 
circular plots on large scale aerial photos acquired Septeraber"20*-21, 1975. 

The number of photo plots in the cluster was determined fay the 
proportion of the line which fell in the strata of interest up to a 
maximum of 10. A subsaraple was taken from the set of photo plots for 
direct measurement of variables by a field crew. Figure 4 shows 
schematically the various activities; a technical summary of the 
system including the estimates is provided in Appendix A. 

2.2,4 Measurement Procedures 

Measurements were made to summarise the stratification scheme, to 
obtain photo variables, and to obtain direct measurement of variables of 
interest on the ground. Stratification data were summarized directly from 
Forest Service records. Photo measurements were made of average parallax 
difference (a proxy for height) , number of trees by various categories 
and percent crown closure by the same categories. Individual tree 
measurements were not considered necessary or practical owing to the 
extreme uniformity of vegetation conditions. Ground measurements included 
tree species class, diameter, ten year radial growth (softwoods only) 
bark thickness (softwoods only), and tree quality class. Site class, 
stand age and average height were recorded as well as necessary 
administrative data. Detailed measurement procedures are summarized. 


2.3 Analysis Procedures 


Estimates and tabular output products were generated using a series 
of FORTRAN programs to summarize the basic data and generate estimates. 

T^vo computers were available for the actual computer processing of the 
inventory data; (1) the large computer complex (GDC 7600) at the 
Lawrence Berkeley Laboratory, which was used for processing the Plumas 
inventory, or (2) a Data General dedicated con^uter using a real time disc 
operating system (RDOS) recently acquired by RSRP for machine processing 
of MSS data. The latter system was chosen for the analysis since this 
project offered an opportunity to evaluate the utility of such a system 
for a medium scale analysis effort involving a large quantity of 
numerical data. The dedicated computer system offered the advantage 
of almost real time turnaround of processing as well as the potential for 
cost savings over the larger computer facility. Computational limits 
of 3-4 significant digits were encountered unless double precision 
computing was used. For this research study single precision computations 
were used.' 


Schematic 


Activity 



1] Stratification 

A partitioning into units of 
relatively homogenous vegetation. 
Based on USFS data. 



2} A random cluster sample within 
each Stratton where cluster sample 
units consist of up to 10 plots 
along a flight line. 

Measurements of Photo Variables 
made for each of the 0.4 acre 
photo plots 



3] A random sample of 0.4 acre 
plots from the set of all photo 
sub plots for direct measurements 
of characteristics on the ground 


Figure 4. SHNF inventory Overview 




2.4 Results 


Using the formrt and notation o£ the Forest Survey Handbook, the 
inventory results are presented in Appendix B. Eight tables are included and are 
numbered, according to the handbook tables references, 10, 11, 12, 13, 14, 

16, 18, and 20. They relate to number of trees (10], volume (11,12,13,14], 
growth and removals (16,18], and mortality (20). Each table also 
includes relative error, (the standard error of the estimate divided by 
the estimate), for each table entry. For comparisons of these estimates as 
they relate to the USFS specifications, see section 2.5.1. Tables 
23-28 relate to production outputs and projections and are not derived 
from the same sources as the standing timber statistics. Tables 1-9, 

15, 17, 19, 21, and 22 are not included for various reasons as outlined 
in section 2.5.1. 

Table 3 shows a summary of estimates for the five basic parameters 
of interest. Precision of the estimates compares favorably with the goals 
of Forest Survey (five percent relative error per billion cubic feet of 
volume, growth, and removals] which are given for very larg:e areas. Goals 
for smaller areas, such as a single national forest would be expected to be^' 
larger. Both growth and volume relative errors are less than 10 percent and 
error for number of trees is about 5 percent. The estimates for both 
removals and mortality are much higher than the other estimates because both 
are sporadically distributed over the area and the sampling scheme is not 
oriented towards obtaining this kind of information. 

Additional estimates would be available on a stratum basis since these- 
estimates are obtained in order to calculate overall estimates. They have 
not been included here since the primary interest is on Forest Survey 
requirements. 

2.5 System Evaluation 


The success of the Sam Houston inventory is indicated in the previous 
section. In addition, however, it is necessary to compare this inventory 
with the Forest Survey requirements as well as evaluate the cost and 
ol-'fectivonoHS of the .survey. Finally various components of the .system 
are evaluated to determine what directions future research efforts might 
take. 

2.5.1 Comparison with Forest Survey 

FSM 4809 provides very detailed specifications for various 
parameters to be estimated. As nearly as practical these specifications 
were the objective for the SHNF inventory. A number of items v/ere not 
provided by this study in as complete a detail as required 
by FSM 4809, Volumes were summarized as gross rather than net because _ 



Table 3. Sam Houston National Forest 1976, 

Summary of Estimates and Comparison with 
Simple Random Sampling 



Stratified Ratio j 

Simple Random 

Parameters o£ Interest 

Estimate 

Mean/Acre 

Estimate 

Total 

Relative 
Error C%) 

Estimate 
Mean/ Acre 

Estimate 

Relative 
Error (%} 

Increase 

in 

Relative 
Error [%) 

Area USFS records) 

- 

145,332 

- 

- 

- 

■ - 

“ 

Number of trees 

160 

23,300,000 

4.93 

165 

23,000,000 

8.74 

47 

Growing stoch volme 
C5" + DBH,Ft3) 

2642 

384,000,000 

7.76 

2810 

408,000,000 

9.71 

24 

Sawtiraber volume, ^ 

Cll" + DBH, Ft^) 

1975 

287,000,000 

10.8 

- 

- 

. - 

- . 

Volume growth (10 year. 
Ft. 3) 

76 

11,100,000 

6.84 

81 

11,800,000 

8.52 

25 

Volume removed (10 year. 

Ft. 3) 

56 

8,070,000 

55.1 i 

55 

7,990,000 

67.8 

14 

Mortality volume 
(10 year, Ft. ^3 

41 

6,020,000 

74.4 

33 

4,800,000 

92.5 

25 


• O 




C; 




assessing defect was difficult for the field crew which was from California 
and had only a limited amount of time for training. Through the use of 
local foresters this difficulty could be removed and on-the-ground 
measurements of net volume obtained. Most tables requiring species 
breakdown use several hardwood classes. Species identification was 
difficult because of the time of year when field work was conducted and 
because of unfarailiarity of the field crew with local species types. 

For these reasons only two species groups were considered, hardwoods and 
conifers. Log grade as well was not evaluated because of the time 
required to train a field crew. Growth, removals, and mortality were 
all evaluated at least partially. The growth estimates do not include 
two components of net growth as defined by forest survey: 1) ingrov^th 

Canal! trees which pass the 5,0 minimum diameter limit) and 2) increment 
in trees which were cut, died or became culls. Also growth, removals, 
and mortality estimates are 10 year rather than annual values. 

A number of Forest Survey tables were not included in the output 
from this study, FS tables 1-3, 8 ,9 require area data readily available 
from the USFS vegetation records. FS tables 4,5, and 6 require areas by 
volume class, stocking class, and stand condition class. These items 
were not incorporated into photo interpretation results but could 
be generated given definitions of the various classes, FS table 7 
requires area by site class and these data are best obtained from another 
source since once obtained they are not likk ly to change. FS tables 17, 

19, 21 and 22 require estimates already available in other tables j th-.^^ - 
tables renresent summaries by ownerships and this study only included 
USFS lands. FS table 15 requires volume by quality class and the 
difficulties of obtaining quality data have been mentioned. Since 
permanent plots were not available and it was considered undesirable to 
use an increment borer on hardwoods, growth was not evaluated for hardwoods. 

Permanent plots which are remeasured over time are generally 
considered desirable for estimating change. The possibility exists that 
large scale photographs of specific sample locations could be used for 
specific location of the sample areas for photo acquisition at a second 
point in time. If the details of precise location could be worked out, 
measurement of photo characteristics might correlate well to change in 
volume etc. It is even possible that some electronic device might be 
utilized for precise location of certain permanent locations. However, 
even without such elaborate measures a short interval between 
measurements, for example five years, would enable relatively easy 
location of sample areas using previously obtained imagery. 

2.5.2 Cost-Effectiveness 

Cost effectiveness comparisons between this survey system and USFS 
procedures in the Southeni Region are difficult due to our unfamiliarity 
with local practices and even Forest Survey procedures in that area. 


Independently of this , however, statenients relating to approximate cost 
in dollars and time are possible. All figures given in this section 
ignore manpower and other costs associated with other tasks of this 
contract. In addition, the effort expended in attempting to extract 
useful information from the Landsat MSS data is excluded. Report 
preparation costs and problems associated with changes in the objectives 
of the study are also excluded. Considering manpower, flying time, 
photographic supplies, computer processing costs, and administrative 
and overhead costs, the survey cost was approximately $25,000 if software 
development is included and roughly $20,000 excluding software develop- 
ment. Manpower time requirements were 8.5 man months for survey 
activities, and 1.5 man months for software development. The ratio of 
ground plot cost to photo plot cost was roughly 5:1 excluding per diem 
and travel to and from the SHNF and including much greater labor costs 
than the Plumas inventory expenses. If all activities could have 
proceeded with the best possible timing the estimates could have been 
provided in about a 6 month time frame, 

2.5.3 Stratification • 

As already pointed out the initial objective was to utilize 
Landsat MSS data in the stratification of the study area. Instead, 
existing USFS vegetation classification results were used. That 
classification showed ten different combinations of vegetation and 
condition. It is interesting to note that fifty-five percent of the area 
occurred in one stratum, namely loblolly pine immature saw timber. The 
second largest stratum included only eleven percent of the area and the 
smallest stratum was only 0,07 percent of the area. Hardwoods occupied 
only five percent of the area. For sampling purposes some stratum 
had to be grouped together. With such a skewed distribution in the 
stratification, consideration should be given to possible alternatives 
for a better stratification. On the other hand this does tend to confirm 
earlier observations on the general uniformity of the area. 

2.5.4 Sampling Techniques and Design Considerations 

Results of the survey in terms of relative precision estimates 
are satisfactory for all except removals and mortality estimates. For 
these parameters some other allocation, stratification or selection 
probabilities might prove more useful. It is also possible that unless 
low precision limits for these parameters are acceptable it may prove 
more reasonable to conduct separate surveys or consider alternative 
sources of information for removals. The use of ratio estimators was 
based on theoretical results and it would be desirable to consider 
post survey analysis of the relationships implied to determine if other 
estimators are more appropriate. See also section 2.5,7. 
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2.5.5 Large Scale ”hoto Measurements 

Aside from tae uniformity of the vegetation complex, two other 
factors contributed to problems in interpreting the large scale, small 
format/photos . First, in the mixed hardwood type, the season of year in 
which the supporting photography is obtained is critical and is 
dependent upon what characteristics are to be estimated. For this study, 
photo acquisition occurred in September when many of the hardwoods still 
had their leaves and were easily confused with the softwoods. It should 
be noted, however, that if the photos had been acquired later in the 
year after the hardwoods had dropped their leaves they could not have been 
separated from the snags in the forest. Second, because a small 
format (SSmra) camera was used to acquire the large scale photos, stereo 
triplets ^^ere required to ensure that complete stereo coverage of the 
sample plot, located in the center of the middle photo, was obtained. 

For a number of the plots two stereo set ups were required, viz., photos 
1 and 2 and photos 2 and 3, to complete the interpretation of the plot 
this was not only time consuming but required the added effort of 
maintaining a double recording procedure. This problem might be solved 
if the photo plots were relocated in the overlap area. 

As shown in Table 3 the relative increase in the estimated error 
when only simple random sampling was used was dependent on which variable 
was estimated. It is possible that some other photo variables or 
combinations of variables might lead to even greater efficiencies for the 
ratio approach. Previous cutting (harvesting or thinning). , was not 
evaluated on the photos, but both types of activities could be 
identified and subjectively at least, it may be possible to estimate 
relative amounts of timber removed. 

2.5.6 Ground Measurements 

The only significant problem encountered with ground measurements 
was that of physically locating the sample areas on the ground. 

Experienced photo interpreters were able to overcome this problem 
reasonably well. Variables which were required by Forest Survey but not 
measured were excluded largely due to limited time for training the 
field crews. 
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2.5.7 Photo-Ground Relationships 

Empirical relationships between five photo variables and five 
ground variables for the 29 paired plots are summarized in Table 4. The 
correlation coefficients show only moderate relationships between photo 
and ground variables except for number of trees which is much stronger. 

However the usefulness of photo variables in the sampling system depends 
on both correlation and the relative variability of the photo variables. 
Coefficients of variation are included in the table to provide the 
additional information needed to determine if ratio estimation would be 
an improvement over simple random sampling. According to Cochran [19633 
ratio estimation will provide greater precision if 

o ^1 CV fx'} 

p > p where p = v CV is the coefficient of variation. 

2 CVCy] 

For the data in Table 4, (l-% con.) and growth yield p °-0 . 7835 and for (DP) and 
growth p°=0.3543. Thus since in the first case p=0.4641 these would be 
no gain and in the second case p= 0.4014 so there would be a gain in using 
ratio estimation. Based on this comparison DP was used as the photo 
variable in a ratio estimate of growth. Similar calculations are possible 
for the other photo variables. 


$1 
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Table 4. Correlations Between Photo and Ground Variables for 29 Plots in the 
SHNF . 


PHOTO 


GROUND 



# Trees 

DP 

1 - % Con. 

if Snags 

1 - % GC 

ft Trees 

Volume 

G tf Trees 

.7510 





1.000 


R Voliime 

-.0703 

.5004 




.3476 

1.000 

0 Growth 

.2244 

.4014 

.4641 



.5220 

.6201 

U Mortality 

1 .0223 

-.0221 

-.1673 

.4266 


- . 0405 

.3343 

N Removals 

.1094 

-.2687 

-.1462 

.1823 

.2320 

.0748 

.3152 


1.000 

-.2598 1.000 

-.2478 -.0892 


1.000 


P if Trees 
H DP 

0 (l“%Con.} 

T Snags ;) 

0 


1.000 

.3592 

.8718 

.4134 

-.5597 


1.000 

-.1554 

-.2303 

-.1882 


1.000 
.1890 
- . 7845 


1.000 

-.1771 



DP = Average difference in parallax 

l-% Con = 1 minus per cent crown closure of conifers 
if Snags = the number of dead trees 

l-% cc =1 minus per cent crown closure expressed in hundredths, (all species) 



Variation 

% 

39.1 
43.4 

38.1 
280.9 
413.5 


50.6 
27.0 

59.7 
34.5 

136.4 








3.0 SURVEY PLANNING MODEL 


Survey planning is by nature not restricted to one particular kind 
of application or population type. The work described in this chapter 
was undertaken in an attempt to better define the procedures required 
to design sampling systems utilizing remotely sensed dataj hence much 
of what was developed is presented in those terras. At the same time 
the sampling background and conceptual development are independent of 
these considerations. The population model used, however, is strongly 
remote sensing oriented in that it implies considerable information about 
the population in a digital format which can be analyzed by conqiuters. 
This kind of information has only recently become available through the 
ERTS and Landsat programs. Early results have shown that these digital 
spectral data can be analyzed and provide useful land use data with 
resolution elements of roughly one acre. This chapter contains 4 major 
sections which 1} present the sampling problem framework and planning 
model conceptual development, 2) describe the components and functions 
of the planning model, 3) present the verification tests and application 
results, and 4] discuss potential applications to small wildland 
populations. 

3.1 Sampling and the Survey Design Problem 

Logical questions when considering a sampling task are: How do we 

design an optimal sampling system and what does ’'optimal” mean? In 
answering these questions, three topics are discussed in this section. 
First, basic approaches to univariate sample design are included for 
historical background. Second, sampling for estimation of several 
parameters is discussed as an increasingly important aspect of sampling 
wildland populations. Third, the conceptual framework for a multivariate 
survey planning model is presented. Orientation of the model is towards 
geographically referenced populations. The population model used in the 
planning context is derived from multispectral data obtained from the 
Landsat satellite and combined with supporting data acquired by direct 
measurement of specific variables. 


5.1.1 Univariate Survey Design 

Survey design is the process of putting together appropriate sampling 
and measurement techniques to estimate a parameter of interest for a 
particular population. According to Raj (1968, p. 31) the fundamental 
principle of sample design is related to the fact that: 

"With every sampling and estimation procedure is associated 
the cost of the survey and the precision (measured, say, in 
terms of the mean square error) of the estimates made. Only 
those procedures are considered from which an objective 
estimate of the precision attained can be made from the sample 
itself. And the procedures should be practical in the sense 


that it is possible to carry them through according to de- 
sired specifications* Out of all these procedures of 
sample selection and estimation (called sample design), the 
one to be preferred is that which gives the highest 
precision for a given cost of the survey or the minimum cost 
for a specified level of precision. This is the guiding 
principle of sample design.” (Raj, 1968, p. 31) 

Thus the objective is a sampling system vjhich will estimate the parameter 
of interest with the greatest precision and the least cost subject to 
whatever constraints may be imposed by the management system. This 
formulation, however, assumes perfect knowledge of exactly what is 
needed and how precisely it is needed, or equivalently how much it is 
worth, in the context of decisions which are to be made using the 
information. This* assumption, however, is rarely, if ever, met and 
considerable additional research is needed on this important question. 

In addition, it must be pointed out that a minimize cost objective 
ignores many benefits which might be associated with one system compared 
to another over and above the cost effectiveness similarities. 

The objective can be translated into specific terms for sampling 
so that a ’’cost effective” system would provide the desired sample 
estimates for the least possible cost, subject to time and other 
constraints , For design purposes , the obvious performance measure is 
the cost of the system which meets all the specified constraints. The 
decision rule, depending on whether cost or precision is considered 
fixed, is to select the system which gives minimum cost subject to a 
specified precision level, or select the system which gives minimum 
variance subject to fixed cost. In a broader decision-making sense 
however, it must be noted that considerations in addition to system 
cost v;ill necessarily be incorporated into the final decision on a 
particular alternative. These factors would include technical 
constraints such as the time of year when measurements can or must be 
made, availability of necessary measurement equipment, training and/or 
experience levels required for various techniques, and capital costs 
associated with setup for particular sampling or measurement techniques; 
social factors relating to motivation and willingness to alter existing 
procedures; and political factors relating to organizational structure, 
function, and responsibility. Additionally, the decision may be based 
on an array of solutions with varying assumptions about the nature of 
the population and measurement relationships. 

Without any knowledge of population conditions survey design plays 
a minor role and sampling is largely restricted to random sampling. 

As observed by Mahalanobis (1952, p, 40): 
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"It is only when some previous information Cwhich may be only 
approximate in nature) is available about the field that the 
problem of the sample-design becomes important. The object 
then is to use the available infoamiation in the most effective 
way to prepare an improved sample-design "the sense that 
it would be an improvement over the best design that would 
otherwise be possible) so that it can he reasonably expected 
to reduce the cost of the survey as much as possible without 
sacrificing the accuracy, or alternatively, reducing the 
margin of error (or uncertainty) to the greatest possible extent 
for the same expected cost." 

Remotely sensed data, as from aircraft and spacecraft, provide auxiliary 
information which is inexpensive relative to direct measurement costs 
and is often related to the variables of interest. This kind of data 
offers the possibility of improved survey design as well as the 
opportunity to minimize the need for costly direct measurements, 
particularly when dealing with large geographic populations. 

Identification of alternative sampling strategies depends on the 
nature of the population, the parameter of interest, cost or precision 
constraints, and other institutional factors. The system will be 
greatly influenced by the definition of 1) the sampling frame, 2) the 
sampling units, and 3) supplementary information and its availability. 

Within the same problem context, variations are possible in the way the 
sampling frame and units are formed; this influences the amount and 
type of auxiliary information which determines the appropriateness of 
different sampling techniques. Previous sampling experience and an 
understanding of the nature of the population and problem context aid the 
designer in limiting the set of feasible systems to a small number of 
likely candidates. Since selecting some subset o£ alternatives offers 
the possibility of excluding the best one, considerable care must be 
used in selecting the set of alternatives for detailed evaluations. 

Given a set of alternative sampling systems, it is still necessary 
to determine the optimum allocation of effort for each. 

The decision rule stated earlier requires minimizing cost (or variance) 
subject to a constraint on precision (or cost). For any given sampling 
alteraative it is possible to evaluate both the cost and precision in terms 
of mathematical relationships, which are functions of sample size. Then, 
using the techniques of LaGrangian multipliers the combination of sample 
sizes which will yield the minimum cost, while still achieving the 
desired precision levels, may be determined. After repeating this 
procedure for each of the alternative designs, the one with lowest cost 
may be identified. 

ii 
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With the population specified and estimation objectives defined, the 
design problem for any system is solved by fixing: 

13 the sampling frame and stratification 

2) the sample units and measurements required 

33 the selection probability scheme and procedure 

43 the estimator functions 

53 the variance estimator functions 

63 the sample size and allocation. 

Success of the optimization procedure is dependent on how closely population 
conditions are reflected by approximations used in allocating resources. 

Since estimates of population conditions are subject to considerable uncer- 
tainty, it may be desirable to adjust them to reflect optimistic or 
pessimistic attitudes towards their value. Several methods can be used 
to modify these estimates to account for uncertainty (Duerr et al., 19753. 

Considerable research has been done on theoretical comparisons between 
sampling techniques and methods (Cochran, 1963; Raj, 1968; Sampford, 1962; 
Hansen, Hurwitz, and Madow, 1953), Most discussions of survey design are, 
however, devoted to listing and describing the vhrious techniques (Raj , 1974) . 
Ek (1968) and Schreuder, et al., (1963, 1971) have made comparisons among 
a number of sampling techniques using forest populations, but design 
and allocation of effort were not included, in their work. 

Aldred (1971) is one of very few who has attempted to put a number of 
techniques together with a particular population representation in a 
planning model. His work was limited to univariate methods and did not 
consider any multistage sampling techniques, but it provides an excellent 
example of a systematic approach to the problem of univariate survey design. 
The performance criteria, P, vjas defined as the variance of the estimator 
U,V(i![k3, vfhere k represented the identifier of a particular alternative 
sampling procedure. The objective for each design was to minimize P subject 
to a fixed budget constraint; a linear budget function was assumed. For 
each design the least-cost allocation of samples was used in a simulation 
of sampling from a population model to evaluate both sampling methods and 
their interactions with various aspects of the measurement process. 

3.1.2 Design for Estimation of Several Parameters 

Even though most surveys are designed fox only one parameter of 
Interest, data are usually collected on other variables as well, since from 
the manager's vieifpoint there is increasing need for information on a 
number of characteristics. It is logical to acquire this data as much as 
possible within the context of one sample survey since a large segment of 
wildland survey cost goes to preparations and travel time. The incremental 
cost associated with additional measurements on selected sample units is 
quite small by comparison. On this basis it is logical to ask: how can we 


design a survey to estimate several parameters, considering cost and/or 
precision desired for all the parameters estimated? To solve this problem 
requires use of the basic decision-making procedures used in the one 
variable design problem but with new objectives and performance criteria. 

When discussing statistical procedures dealing with several variables, 
certain definitions of terminology are desirable to avoid confusion. 
'Multivariate”, taken in Its general sense, simply means several, or more 
than two, variables; however, it also refers to a body of statistical 
analysis procedures dealing with the multiple variables as a set CKendall 
and Stuart, 1968, p, 239). The key feature of multivariate data is that a 
vector of observations is associated with each sample element in the data 
set. Multivariate analysis makes inferences based on the set of variables 
as a whole rather than separately. On the other hand there are cases 
where several variables are considered but the data sets are definitely 
not multivariate in nature, and these may logically be referred to as 
"multiparameter" data sets. For exanqsle, if data on gross yield for a 
set of N trees are available, but net yield is only recorded for a 
small subsample of the N trees, then the data is not multivariate since 
there is no vector of observations. The data sets would more 
reasonably be referred to as multiparameter. The terminology problem is 
further complicated by the analysis procedures which may be used to generate 
estimates and make inferences based on the data sets. Multivariate data 
sets could be analysed using either the usual tmi variate estimation methods, 
with each variable being treated independent of the others, or they could 
be analysed as a set using standard multivariate procedures [Morrison, 1967) . 
The survey planning model developed here has been termed a multivariate 
model since the sample data set is in fact a vector of observations for 
each sample unit. It must be qualified, however, since analysis procedures for 
evaluation of alternative designs are based on mivariate estimation methods 
for each of the variables. Multivariate analysis could be incorporated if 
simultaneous inferences (Miller, 1966) were needed for management 
decisions. However, at this point the decision problems have not reached 
a level of complexity where simultaneous inferences are useful. In this 
study, the term multivariate will be used in its general sense and will 
not refer to the very specific multivariate statistical analysis context. 

3. 1.2.1 Objectives and the Decision Criteria 

Since sampling for several variables is a natural extension of the 
univariate case, optimization objectives and the de-cision criteria reflect 
most of the limitations outlined in the previous section on univartiate 
' methods. These included 1) the possibility that the set of alternatives 
considered in the design process might not contain the true least-cost 
alternative, 2) the assumption that the precision desired reflects the 
utility of information derived, 3) the use of a decision criteria which 
does not evaluate benefits, as well as cost, of one system as compared to 
the other, and 4) the need to include exogenous variables in deciding which 
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alternative is "best”. 


One objective 1 which evolves naturally from univariate procedures 
would be to minimise cost subject to variance constraints for some or all 
of the parameters to be estimated. Others have been suggested including 
1) minimize cost plus loss (Cochran, 1963, p. 120} where loss is a 
weighted function of variances for each parameter; 2} a variation on the 
approach of Wens el (1974} which is to minimize a weighted function of 
relative variances subject to a constraint on cost; and 3) minimize cost 
subject to constraints on the generalised variance, the detei-minant 
of the covariance matrix (Chakravarti , 1954; Ghosh, 1958, p. 162; Arvanitis 
and Afonja, 1971}. Only the first approach places precision constraints 
individually on one or all the characteristics to be estimated. Each 
of the others implies use of rather novel performance criteria or 
constraints. If, for example, appropriate weights could be specified, 
the composite variance function of Wensel’s approach could be used. 

However, definition of loss functions, weighting factors and the meaning 
of the generalized variance require additional evaluation and testing 
to determine how they might best be used, if at all, in the context 
of current decision-making and management activities. 

Minimizing cost subject to constraints on variances, or providing 
estimates of each parameter within desired precision limits, is useful in 
the current management context since parameters are generally not all of 
equal importance. Additionally, managers can usually specify precision 
levels which reflect the relative importance of the different parameters. 
This objective and the implied performance criteria, system cost, will be 
assumed in the later development of the survey planning model. The 
decision zule will be to select the alternative design with the lowest cost. 

5. 1.2, 2 Allocation of Effort 

After specifying the precision desired for each variable and the 
minimum-cost objective, several approaches can be taken to determine the 
best allocation of effort. Univariate methods could be used to make a 
sample allocation determination for each parameter and then by some 
process derive a sampling procedure which would provide for all required 
estimates. Examples could include 1} separate independent samples for 
each parameter, 2} several independent samples to survey groups of 
related characteristics , 3} a single sample but with measurements taken 
only on the minimum required number of sample units for each parameter so 
that certain units would not be measured for all variables, and 4} a sample 
selecting the maximum number of units required for any variable and still 
measuring all characteristics on all units. These univariate procedures 
offer some control on precision, but do not necessarily minimize the cost. 
Further, opportunities for simultaneous inferences are limited and strict 
multivariate analysis procedures are not possible except for the fourth 
alternative which generates a multivariate data set. 
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Assianing that all characteristics are measured on each sample unit, 
a multivariate allocation based on a mathematical minimization procedure, 
referred to as programming Chillier and Lieberman, 1967), and discussed 
extensively in later sections, may be en^iloyed. Using that approach, the 
objective is to minimise the cost of the survey, expressed as a 
mathematical function of sample sizes, subject to constraints (usually 
variance) for each parameter of interest. Programming is the mathematical 
procedure of finding that allocation of sample sizes which yields the 
lowest cost. Huddleston et al. (1970) describe application of this 
technique to stratified sampling and compare it to two other allocation 
procedures. Their example showed that with an allocation based on the 
fourth univariate method listed above, over-sampling occurred and 
estimates were more precise than required. Using another allocation, 
based on the average of sample sizes for each variable by strata, sampling 
failed to meet all precision constraints. In Huddleston's stratified 
example, allocation by the programming technique yielded the lowest 
cost while still meeting all constraints. 


Comparing the programming allocation approach with the third 
imivariate method is difficult due to problems of identifying differential 
sampling rates and incremental measurement costs associated with each 
variable for the univariate allocation method. For a stratified example 
comparison is possible by considering the general form of cost functions 
for both methods of allocations. Consider the two functions for total 
variable cost (TVG) for the programming (PRO) and separate (SEP) 
allocation alternatives : 


L M 

TVCCPR05 

L M 

TVCCSBP) 


where L = the number of strata 

M = the number of parameters 

= the sample size by programming for the h^ strata 

nj^^ = the sample size by optimum allocation for the 
l^tb strata and variable i 

"Umax" 

Cj^^ = access cost for units in strata h 

Chb ~ implementing differential sampling rates 

in each strata 


c, . = incremental measurement cost for variable i in 

strata h. 


The second term in the cost function will he greater for the programming 
approach since all variables are measured for each sample unit. But, the 
first term will be greater for the separate approach since 1) the * 
maximum number of sample units for each variable must be visited in each 
strata and 2} there is the additional cost due to differential • sampling 
rates. The actual values of the cost coefficient determines which 
alternative is less costly; however, if access costs are high compared 
to measurement costs and the terra increases as the number of 
variables, it is likely that the programming approach will be the most 
cost-effective way to allocate effort. 

3,1.3 Formulation of the Multivariate Survey Planning Model 

Given the important role of sampling for resource management and the 
increasing emphasis on multivariate sampling applications, it is evident 
that a formal planning mechanism is needed to document and improve the 
selection of techniques and procedures to be used in meeting estimation 
objectives. One approach to providing such a mechanism for multivariate 
survey design is the planning model developed in this study. 

A basic function of the model is to systematically evaluate 
alternative sampling schemes using available population and cost data. 
Results of this evaluation include estimates of cost and implementation 
specifications for each of the alternatives. One significant benefit of 
the planning model approach is that assmptions, inputs, and evaluation 
techniques are all formalized and subject to review, modification, or 
improvement . 

T\vo assumptions are crucial to the model development. First, advance 
data is needed for each of the variables of interest. In addition, the 
data must be geographically referenced in detail so that it can be 
combined and summarized as required by sampling alternatives. This kind 
of population data, even if it is somewhat crude, allows 
flexibility in evaluating, among other things, size and shape of sampling 
units and methods of stratifying the population. Second, the goal is to 
minimize cost subject to precision constraints for each of the parameters 
to be estimated. Generally a survey budget is fixed in advance by 
administrative considerations, but desired precision and expected cost 
must be consistent with overall budgeting limitations. A minimize-cost 
objective is useful even if a budget is fixed since it will show the 
ejfpected survey cost. If this cost is greatly different from the fixed 
budget, either above or below, the budget or precision limits need to be 
modified, otherwise money would be spent acquiring extra data not really 
needed, or the full budget would be spent and the result would be far from 
meeting precision objectives. 

Figure S shows the basic components of the planning model. Advance 
population data and problem specifications are the primary inputs required, 
and a central control mechanism is needed to coordinate the other model 
components. The complexity of the population model depends on the nature 







of available population data. In an extreme case the data could be 
complete so that only an input device would be required; on the other hand 
it is more likely that several data sources would be combined to form a 
population model, possibly with simulation of missing or incomplete data 
elements. A preprocessing activity summarizes population data in a 
specific form required for a particular sampling alternative. Then, 
given the summary data on the population and other inputs, the allocation 
of effort component identifies, for a particular alternative, the 
sample sizes v;hich minimize cost while meeting precision constraints. 

Aside from model control which coordinates and governs the scope of the 
model, allocation of effort is the most complex component, with 
proceeding components preparing data for its use and subsequent components 
using its results. Finally, in preparation for making summary outputs 
an evaluation of the set of alternatives is made to rank them by cost. 

Because of the large amount of data required and the complexities 
of the solution algorithm, implementation of -^-he model requires access 
to a large scale computer, 

3. 1.3.1 The Population Representation 

A population representation, such as described in Chapter One provides 
a very important starting point for the planning model. It must preserve 
the spatial distribution characteristics, a key feature of geographic 
populations, which may have a significant impact on the sampling 
methods used. In addition, if the data base has been well designed and 
maintained, available and pertinent data for the population will be 
readily accessible. With this data as a starting point, simulation 
techniques may be used to fill any data gaps. 

Most geographic data base structures use either the polygon or grid 
approach. The first uses a series of polygons, with coordinates of 
vertices in some convenient system, to code information on locations of 
the data elements. The major advantage is that less data storage space 
is required, but this is offset by the requirement for more complicated 
analysis methods. If the area is partitioned into mutually exclusive 
cells, data elements may be assigned to each unit in the population. 

Size, number and shape of the cells determines how much detail is reflected 
in the data base, A 40 acre cell size would give a very different 
representation than a 4 acre size. A major disadvantage of the grid system 
is that more storage is required since each element in the population 
must be identified and labeled; however, this is offset by simpler analysis 
methods . 

Simulation is often useful since numerical data relating to the 
parameters to be estimated are based on relatively small sets of sample 
data for various classes of cells within the area. The cell assignment 
may be known for each element in the population, but estimates of 
specific parameters are usually based on limited sample data. A cell -by-cell 


class assignmentj or stratification, is an excellent tool for 
evaluating spatial distribution as it influences sampling methods. 

In this study such a digital data base is available as the result 
of a discriminant analysis of muitispectral data, acquired by 
satellite. Each cell, roughly one acre in size, is assigned to one 
class in a land use and vegetation classification scheme. 

Complementing this partitioning into land uses and vegetation classes 
are sample data providing estimates of specific parameters of 
interest for each of the classes. 

With appropriate assumptions about the probability distribution 
of the variables over the population it is possible to simulate 
individual cell values with average characteristics maintained and 
spatial distribution influences preserved. This technique seems 
very useful where population sampling units are likely to be clusters 
of cells rather than individual cells since the variability due to 
simulation is reduced by clustering the cells. The most obrvious 
distribution for simulation is the multivariate normal which is often 
not too realistic, but is easily simulated. Other distributions 
derived from the multivariate normal including multivariate chi-squared 
and t could be simulated to account for some types of departures 
from normal. With a cell-by-cell population representation, simunaries 
are easily made to obtain variance components required for the 
sampling alternatives to be evaluated. 

3. 1-3.2 Allocation of Effort 

Ansv/ering the questions. How to sample?, and what cost?, are the 
goals of the allocation of effort component. It requires as input 
summary data and produces as output, for a given alternative sampling 
system, the sample sizes and allocation as well as implementation cost. 

To achieve this goal requires, more specifically, 1) mathematical 
representations of variance relationships for a specific sampling 
scheme and cost of sampling activities, 2) estimates for cost coefficients, 
3^ estimates of population variability, and 4) a method for finding 
the least cost allocation of sampling resources. Mathematical models for 
cost and variance relationships can be obtained for particular 
sampling schemes. Cost of various sampling activities can be obtained 
from previous experience. Nonlinear programming, a mathematical 
procedure for determining an optimal allocation of resources subject to 
certain constraints, provides a means for allocating sampling resources. 
The general nonlinear problem has not been completely solved; however, 
jiost sampling problems fall into a special class of problems for which 
solutions are possible. Success or failure of the planning process 
depends largely on how closely the inputs to the planning model correspond 
to the actual relationships in the population to be sampled. 

In summary, the planning model utilizes a population representation 
which is derived from the data base of the information system and it 
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incorporates as much o£ the basic data as is possible including 
simulation of certain detailed values on a unit-by-unit basis. This 
representation then forms the basis for generating specific inputs for 
an allocation model, a convex nonlinear programming problem formulated 
for a specific sampling system. The resulting allocation of effort 
minimizes the associated cost function. With these basic elements, the scope 
of the model is limited only by the alternative systems which, can be 
formulated for allocation and by the details of cost or other comparisons 
which are possible among alternatives. 


3.2 Tlie Multivariate Survey Planning Model for Geographically- 
Referenced Populations 


From a technical viewpoint the planning model utilizes a series 
of input data sets and is composed of several computer algorithms 
which create the detailed population representation and enable 
evaluation of various sampling alternatives. This section outlines 
the components of the planning model including data inputs, the 
population representation, and the resources allocation technique 
as well as the process of developing an optimum sampling system. 

Figure 6 summarizes the sequence of activities which leads to 
development of optimum sampling system. These activities relate to 
1) generating a population representation, 2) identifying alternative 
sampling systems, 3) allocating resources for each, and 4] 
selecting the best alternative, A brief summary of the computer 
software package based on the planning model is included as 
Appendix C. 

3.2.1 Planning Model Data Requirements 

A number of planning model inputs are required to define the 
problem context and saimpling alternatives to be evaluated. The 
success of any design effort depends on the amount and quality of 
advance information about the population. At the outset the exact 
limits of the population must be defined so that the sampling frame 
may be identified or constructed. For geographic populations this 
means delineating boundaries, especially any areas to be excluded 
from consideration because of physical or other considerations. 

Examples would include such things as water bodies, ownership boundaries, 
urban areas, or special areas reserved from management for political 
or ecological reasons. 

The specific parameters to be estimated and relationships among 
them may have a significant impact on the allocation of sampling 
effort. If the variables are highly correlated the sample allocations 
may not be substantially different from a univariate allocation. On 
the other hand, lack of any strong relationship will mean that one of 
the parameters will require more sampling than the others, and the 
cost of acquiring this larger sample will be the lower limit on 
Implementation cost. In any case, however, the parameters to be 
estimated must be defined explicitly so that measurement procedures 
and population estimates used in the planning phase are consistent. 
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Figure 6. Generalized sequence o£ planning model 
activities . 


specify population and parameter set 

i 

generate population elements 

I 

identify feasible alternative sampling 
systems 

summarize population representation for 
input to allocation process 

i 

allocate resources for each alternative 

I 

compare system costs and select the best 
system from the set of alternatives 

i 

summarize specifications for the best 
system 


Along with this definition, the desired precision associated with 
each parameter must be determined since these form the major 
constraints which the system must be designed to meet. Of course,^ 
the actual implementation of a sampling system based upon this design 
may or may not achieve the desired precision. This could be caused by 
errors in assumptions about the population characteristics, or by the 
possibility that an "unrepresentative" sample is obtained. Specifica- 
tion of desired precision must take this possibility into account 
since the level of precision specified affects the likelihood of a 
"wild" sample. 

. With the sampling population limits defined, and parameters 
specified, the next requirement is an estimate of the population 
variability for each of the parameters. Since this input data is 
dependent on the nature of the population representation it is discussed 
in the next section. Finally, sampling alternatives and the associated 
activity costs must be provided so that overall system costs can be 
evaluated. Absolute cost values are not necessary as long as relative 
relationships are maintained between alternatives . 

3.2.2 The Population Representation 

Several possibilities exist for population representations which 
would be useful in survey design. Aldred (1971] for example, used a 
large sample of data obtained from medium scale aerial photography in his 
univariate planning model. With an interest in stratification, clustering, 
and a set of parameters, however, the approaches that can be used are 
more limited. This study utilizes inputs from several sources and a random 
variable simulator to obtain a spatially referenced population represen- 
tation which can then be stratified and clustered in many ways. 

The input sources for estimates of population conditions include 
interpretations of digital multispectral data obtained by satellite as 
well as complementary sampling data for specific parts of the population. 

The simulation of individual population elements assumes a 
multivariate normal distribution which is very convenient for simulation, but 
may not be indicative of actual distributions which are often nonnormal 
and skewed. Simulation of multivariate nonnormal distributions might 
be desirable in some cases but is considered a refinement which could 
be incorporated later if data on distributions of the variables becomes 
readily available. 

The Landsat data base provides a cell-uy-cell model of the population 
v^here each ceil is roughly one acre in size. Estimates of the actual 
parameters by strata are provided from other sources which could include a 
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special pilot survey or previous surveys in the same or a similar 
population. 

Perhaps the single most important planning model assumption, and the 
one least subject to verification is that the population representation 
is a reasonable reflection of the true population relationships. Sampling 
based on a design developed using this planning model would provide 
one possibility for evaluating sampling system population model, but 
even this is subject to sampling variability which may still not give a 
true representation of actual population conditions. Thus, even though 
the sampling system found to be optimum relative to the planning 
model is used, this system may not be optimum with respect to the 
actual population. 

3.2.2. 1 Population Simulation 

The population simulation is based on 1) a spatially referenced 
gridded data base, 2) a vegetation or land use class assignment for each 
data ceil, and 3) parameter estimates for each of these vegetation and 
land use classes, A set of parameter values is simulated for each cell in 
the population. 

The Landsat [NASA, 1971) multispectral data structure provides a 
convenient representation of large geographic areas (100x100 nautical miles) 
for manipulation using computer systems. The data base may be viewed as 
a Cartesian coordinate system with each pair of coordinates representing 
the location of a single data cell. These cells are commonly called "pixels", 
or "picture elements". The Landsat system acquires four items of 
spectral reflectance data for each pixel and provides the starting point for 
generating the population representation. This data base is very 
convenient for computer manipulation and may be related to any nimiber of 
other "profiles", or levels of information such as elevation, aspect, coordinate 
systems, ownership, or other institutional and organizational data which 
are referenced to the Landsat coordinate system. 

In order to relate the population limits to the Landsat data base, 
the MAPIT data handling system (RSRP, 1976) has been utilized. Among 
other things, MAPIT permits the intersection of two or more profiles of 
information so that a particular study area can be defined and extracted 
from the larger full scene base. This is accomplished by translating a 
representation of the study area in one coordinate system to that of the 
Landsat data base. A least squares procedure is commonly used to fit 
a mathematical function using a set of control points with coordinates 
observed in both systems and well distributed over the study area. These 
same control points can also be used to determine the empirical average 
cell (pi.’cel) size for a particular study area since there are inherent 
variations in scale for each different Landsat scene. 
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With the study area physical limits defined, and the basic 
spectral data available from Landsat, the problem is to identify the 
land use or vegetation class for each pixel from some appropriately 
defined classification scheme. Dealing with wildland populations this 
scheme would include such classes as water, agricultural, urban, and 
various classes of vegetation related as nearly as possible to major 
vegetation types and conditions. Several techniques for accomplishing 
this classification are available and they all involve the statistical 
techniques of discrimination and classification (Kendall and Stuart, 

1968, p. 314). Discriminant analysis would assign to each pixel a class 
from a predefined set of classes. Classification would group the 
elements into clumps or classes which would be as distinct from each 
other as possible. These classes however, would be obtained 

independent ly of any predefined scheme. Additional work would then be required 
to relate classification results, sometimes referred to as "computer 
classes," with the predetermined class structure. 

Given the classification (meaning either discrimination or classification) 
results, and the estimates of the population average parameter set (by strata) 
individual parameter sets are simulated for each pixel in the population. 
Simulation is based on the assumption that the parameter set for 
each strata has a multivariate normal distribution with parameters, 
mean vector and covariance matrix, determined by class assignment. 

Loosely speaking, the simulation is of a stratified multivariate normal 
distribution with the classes representing strata and each pixel preassigned 
to a class. This simulation, following a technique outlined by Naylor 
et al, (1968), preserves the relationships among the variables which 
are represented by the covariance matrix. Additionally, certain classes 
may not be of interest for sampling, e.g. water, urban, agricultural areas, 
etc., and these may be effectively deleted by giving a zero vector for 
the parameter set. Estimates of the mean vector and covariance matrix for 
each of the land use or vegetation classes are usually based on some kind 
of sampling procedure either previously on the same area or another 
similar area. Since these advance estimates are based on sample data 
which may not even be from the same population area, it may be desirable 
to modify them to account for either optimistic or pessimistic attitudes 
about their quality or validity. No provisions exist in the planning model 
for changing those variability estimates, rather they must be modified 
prior to input. 

3. 2. 2. 2 Preparation for Sampling and Allocation of Effort 

From the population simulation, an abbreviated population 
representation is derived which includes several elements of the sampling 
system and also helps to minimizes the variability due to simulation 
while at the same time allowing incorporation of spatial relationships. 

This abbreviated representation provides the population variance components 
necessary for allocation of sampling resources. Rectangular or square 
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clusters are formed from the basic internal data base structure and these 
become primary sampling units fPSU’s] and the sequence of PSU's 
becomes the sampling frame. 

A stratification of PSU's is introduced at this point based on 
counts of pixels in the various classes of interest, excluding water 
and others not of interest. The strata for any particular PSU then 
becomes the class which has a plurality of pixels. This straightforward 
interpretation of PSU conditions was utilized in the model since it closely 
resembles the point -by-point classification results. If the overall 
classification scheme meets management needs, then it is inferred that the 
stratification of PSU's by this method would also meet management needs. 

Selection probabilities are derived for computing the population 
variance components. Either "equal" or "proportional to size" probabilities 
for selection of PSU's may bo used. The "size" variable is the number 
of pixels in "good" classes, classes of interest. Equal selection 
probabilities are assumed for subsamples within PSU's. Population 
variance components are then derived using these probabilities for both 
between and within PSU's. 


3.2.3 Allocation of Effort 


For each alternative sampling system the optimtun allocation of effort 
is that which minimizes cost subject to a number of constraints. The 
usual LaGrangian minimization techniques of univariate survey design 
become more and more awkward to deal with as the number of constraints 
increases, and for this reason other methods need to be considered. 

3.2.3. 1 The Allocation Problem Formulation 

To overcome computational difficulties associated with multiple 
constraints, mathematical programming techniques have been developed. The 
general programming allocation problem is formulated as follows 
CHillier and Lieberman, 1967 j : 

minimize f (x^,X 2 , .... ,x^^) 

subject to 


g]^CXi,X2,.. 
g^CXj,X2, . . 






where the (xj[^ | i=l. . .nj are the decision variables, allocation values, and £, 
the objective function, and[g.jj j = the constraints, are given 
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functions of the decision variables. The well known linear programming 
formulation is a special case where the functions are all linear. For 
the general formulation the Kuhn-Tucker conditions (Hillier and Lieberman, 
1967 p. 575; Fiacco and McCormick, 1968, p. 20} establish necessary 
but not sufficient conditions for location of a local minimum. If 
convexity assumptions are met (Hillier and Lieberman, 1968) , that is f 
is convex and the gj are concave functions with all functions continuous 
and differentiable,”^ then meeting the Kuhn-Tucker conditions constitutes 
sufficient conditions for a local minimum, and a local minimum for the 
convex problem is a global minimum (Fiacco and McCormick, 1968, p. 90). 

Kokan and Khan (1967) have shown that stratified random sampling 
variances and a typical linear cost function form a convex set. In 
addition, the techniques of two stag - and double sampling have been 
included as other examples of the application of convex programming to 
sampling problems (Kokan, 1963). Intuitively it follows that sampling 
problems will generally meet these assumptions of convexity; however, 
any violation of these assumptions is important and checks for 
convexity have been incorporated in the planning model. 

Wildland sampling applications of nonlinear programming have been 
largely limited to replacement strategies for continuous forest inventory 
(Hazard, 1974) and the work of Arvanitis (1971) for stratified sampling 
using a generalized multivariate variance function. 


3. 2. 3. 2 Example of An Allocation Problem 


Stratified saji^jlied is a logical example for demonstrating the 
nonlinear programming problem formulation for a sampling problem. Several 
different approaches have been used in the literature (Ghosh, 1958; 

Kokan, 1963; Kokan and Khan, 1967; Chatterjee, 1968; Huddleston, Claypool, 
and Hocking, 1970; Arvanitis, 1971). This formulation is included here 
as a basic example and because the ntimerical data given by Huddleston et al. 
is used as a test problem for one component of the planning model. The 
objective function to be minimized is 


L 

* '2: * "K2 vfV ) 

h=l 


where c 


hi 


= measurement cost for a unit in stratum h 


c, o = travel cost between units in stratum h 
h2 


L = the number of strata 


The primary constraints are a set of M variance expressions, one for each 
parameter to be estimated: 
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N, 


In terras of the general formulation this becomes 
V. - VCY.)>0, j - 1,2, 

Mnimum and maximum sample size constraints .would also be imposed to make 
the solution set bounded. 


3. 2, 3.3 The Allocation Algorithm 

Many computer algorithms have been developed for the nonlinear 
optimization problem (Hillier and Lieberman, 1967) but none are completely 
satisfactory (Fia-cco and McCormick, 196S) . The allocation algorithm 
used for the planning model, SUMT, is based on the theoretical work 
of Fiacco and McCormick (1968) and was developed by Research Analysis 
Corporation (My lander, et al, 1971). It is a computer software package 
which utilizes the ftinctions as well as their first and second partial 
derivatives to, identify the (K^li =l...n) which minimizes a composite function 
(P) made up of the ob|ectiye function and. the constraints. Hillier and 
Lieberman (1967) provide a relatively intuitive description of the way 
the algorithm works. Basically the constraints form a penalty function 
which is forced to go to zero and then the minimum of P is the minimum 
of F subject to all constraints. 


To implement SUMT the user must specify a number of parameters 
which control the operation of the program, but more important three 
FORTRAN subroutines must be provided to evaluate the various functions 
and their first and second partial derivatives. Since specifying these 
functions is often a source of error in implementation, numerical analysis 
methods are available at the user's option to approximate these 
partials either as a check on the user supplied subroutines or to be used 
In place of them. 

3.2.4 Development of an Optimum Sampling System 

Development of an optimum system is comprised of three interrelated 
activities: 1) identification of alternative feasible systems; 2) 

optimization within alternatives; and 3) selection of an optimum sampling 
system. The planning model provides a convenient fool for examining and 
optimizing a number of alternative systems. Qualifications on the 
term "optimization", outlined in chapter two, apply to the planning model 
since a limited number of alternative sampling strategies are available. 
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3. 2,4.1 Identification of Alternative Feasible Systems 

Alternative sampling systems are identified as particular 
combinations of cluster size and shape, stratification method, selection 
probabilities, and sampling technique. The first three factors influence 
the summary population representation and the last factor relates 
to the particular optimization formulation. Rectangular or square clusters 
up to a maximum of 90 x 90 pixels are pefmitted. As described in the 
earlier section on preparation for sampling, only one stratification 
5 'heme is available in the current version of the model; selection 
probabilities can be either equal or proportional to si;;e. The model 
allows three sampling strategies; stratified, stratified two stige, 
and stratified two stage with double sampling. 

Stratified sampling assumes direct measurement of all units within 
a cluster arl s therefore impractical except for very small cluster 
sizes. StTii^j + ied two stage sampling eliminates the need to measure the 
entire saraplxng unit by introducing a subsample to estimate sampling 
unit characteristics. Stratified two stage with double sampling assumes 
that in addition to subsampling within PSU's, the set of secondary 
sampling units (SSU's) over all PSU's is measured to evaluate an auxiliary 
variable (x) which is related to the ultimate variable of interest (y) . 

If the relationship is reasonably good and these x variables are relatively 
inexpensive to evaluate, at least when compared to the cost of evaluating 
y, then a large number of SSU's may be sampled and a smaller subset of all 
the SSU's sampled for direct measurement of y, following the classical 
double sampling procedures (Cochran, 1963). 

Utilizing a regression model based on the paired (x,y) observations, 
an estimated characteristic y is obtained for the larger set of unpaired 
observations. Remote sensing data can often provide the auxiliary data 
in a very cost effective manner. This is tru3 especially if the 
relationships between x and y are reasonably good and the cost of photo 
measurement is small compared to direct ground measurement. A linear 
relationship is assumed in the planning model. This is the method used in 
the Plumas National Forest inventory of 1974 (Titus et al., 1975; Colwell, 
1974) where the first stage units were rectangular clusters of 225 pixels 
(45 X 5 pixels) , second stage units were 0.4 acre circular plots located 
within PSU's, and the third stage units were a subset of the set of all 
second stage units. The strength of the relationship between the two 
variables is reflected in the correlation between the two, an input to the 
model. Currently, the same correlation is assumed for all parameters and 
strata but this could be expanded to include differences between 
parameters and/or strata. 
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Associated with each o£ the sampling methods is a cost function which 
determines the cost of implementation given the various sample sizes. Al] 
cost functions include terms for cost of direct measurement and also travel 
cost between units using a square-root-of-sample-size term based on the 
rationale presented by Hansen, Hurwitz, and Madow [1953, p. 272]. 

These cost functions include only variable costs associated vrith each 
alternative since fixed costs do not affect the allocation procedure. 
Appendix D shows the explicit cost functions associated with each 
alternative technique. 

3. 2. 4. 2 Optimization of Feasible Alternatives 

Optimization of an alternative system requires appropriate summary 
population data for the particular sampling framework chosen and a 
nonlinear problem formulation for the basic sampling strategy. Of the 
three sampling strategies included in the planning model, stratified 
sampling has been outlined earlier. In addition, expressions for the 
first and second partials of the objective and each constraint function 
are required. Similar formulations are required for stratified two 
stage and stratified two stage with double sampling. Second partials for 
the latter alternative were very cumbersome to evaluate algebraically so 
numerical differencing methods were utilized instead. A summary of 
the programming formulations, including first and second partials for 
all three strategies is given in Appendix D. Given the appropriate 
programming formulation and population summary data, the SUMT algorithm 
is executed to obtain the optimum allocation of effort and its 
associated cost. 

3. 2.4. 3 Selection of an Optimum Sampling System 

The planning model can evaluate one or more alternative sampling 
systems and 'will rank the alternatives by cost values. Sampling 
allocations are summarized and constraint values and other solution 
parameters are provided which may be useful in making the final decision 
among alternatives. As has been pointed out earlier cost comparisons alone 
will not provide the final word in selecting a best alternative. Fixed 
cost requirements for different systems may be significant and would have 
to be considered. Since each population representation and the other 
model inputs may be quite different in different applications it will 
generally be necessary to consider several different precision levels 
and cost coefficients until sensitivities of the model to various inputs 
are determined. Even then it may be necessary to document the relation- 
ship between cost and precision constraints and evaluate the factors 
which are limiting the least-cost solution so that managers will become 
aware of the importance of careful consideration of precision limits. 


Finally, but oftra o£ most importance are the factors other than 
strictly design considerations which are likely to affect the choice of 
a particular system. Practical limitations such as the amount of work 
which can reasonably be accomplished in an 8-hour work day may affect 
cluster sizes or sample size and allocation. Related to this would be 
overall time constraints for survey completion. Other factors have 
been mentioned earlier including expertise and training requirements, 
and organizational structure. 

3. 2.4.4 Specifications for the Optimum Sampling System 

The results of the planning phase need to be summarized in the 
form of a plan for implementation of the optimum sampling system. This 
very important aspect of survey work is often neglected and can 
seriously affect the overall success of the survey. A complete 
implementation plan is a necessary aid for focussing effort and ensuring 
that objectives are met with a minimum of compromise. It should include 
a statement of objectives, desired products, as well as summaries of 
all population data utilized in the planning phase and should address 
the following topics in considerable detail: 

1) estimation objectives 

2) summary of planning phase 

3) sampling system description 

4) implementation procedures and specifications 
analysis procedures. 
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3. 3 Model Verification and Application 

For model construction, testing, and application, an area in 
northern California was utilized because as a result of prior research 
the kind of data required by the planning model was available. Tnis 
section is divided into three major parts. The first describes the 
study area and construction of the population representation. Discussion 
of verification tests and application results comprise the other 
two parts. 

3.3.1 The Study Area and Data Inputs 

Previous research work in the Plumas National Forest lead to the 
selection of the Quincy Ranger District as the study area. As a result 
of a 1974 survey for the entire National Forest, conducted utilizing 
Landsat multispectral data, large-scale photography, and ground measure- 
ments Cfitus et al, 1975), a data base of the type required for the planning model 
was available. A stratified three stage sampling system was used with 
the Landsat data aiding in making the stratification and generating 
selection probabilities for first stage sample units. The second and 
third stages utilized large scale photo and ground plots respectively for 
successively more detailed measurements. Federal lands in the Quincy 
Ranger District area, approximately 196,000 acres comprised the study 
area, as shown in Figure 7. 

This area is a good representative of the northern Sierra Nevada 
mountains of California with an elevational range of 2000-7000; including 
both the western and eastern slopes of the mountain range. 

The gridded data base structure generated by the Landsat (then 
called ERTS) multispectral scanner system (NASA, 1971) was used in the 
population model. All other data sources were transformed to the Landsat 
base. This base is made up of a set of picture elements (pixels) 
which contain a vector of specific observations made by the scanner at 
a particular point in time. The on-the-ground area covered by each 
pixel varies according to parameters associated with the satellite 
orbit and its relationship to the area scanned. For the Plumas NF it 
has been determined empirically that the area represented by each pixel 
is a little more than one acre. Individual cells are referenced by an 
(x,y) coordinate where each point is one pixel. To identify the study 
area explicitly and to delete all internal blocks of private ownership, 
an ownership base map was digitized into a local (x,y) coordinate system. 

This local system was then transformed to the Landsat system using 
mathematical functions based on least squares regression analysis of a 
series of control points located in both coordinate systems. The results 
of this intersection of ownership and multispectral data s .own in 
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Figure 8, a color composite of three spectral bands for the Quincy Ranger 
District, including roughly 172,000 pixels and representing 196,000 acres. 
The blacked out areas represent non-federal lands. 

With the source data and study area combined, a maximum likelihood 
discriminant analysis (RSRP, 1976) was performed to assign each pixel 
in the population to a vegetation or land use class according to a 
structure determined by the training areas. As required by discriminant 
analysis, training areas or examples of particular classes of vegetation 
and land use are required on which to base the rules for assignment of 
individuals to specific classes. For the study area two separate 
analyses were made, because of the Landsat data structure, each with 
50 to 60 examples of various vegetation and land use categories. Further 
3^ggT'eg3^tion of vegetation classes into 17 broad vegetation classes was 
made by interpreting each of the training areas in light of the desired 
stratification scheme which included three classes of vegetation (mixed 
conifer, east side pine, and true fir) and within each vegetation class, 
five classes of vegetation condition, (regeneration, immature, mature, 
overmature, and poorly stocked) were recognized. Two other classes 
were specified, hardwoods and a residual of all other classes. Results 
of this class aggregation are shown in Figure 9. 

Associated with each of the classes are a number of variables 
important for management. Depending on the particular ecosystem 
component to be m.anaged the set could be limited to a few major variables. 
I^or example, the 1974 inventory in the study area- concentrated on 
variables important for timber management, and therefore, many variables 
useful in managing such components as recreation, watershed, wildlife, 
or range were not included. Still, a basic approach to a number of 
resource values depends on the vegetation components and their condition 
and distribution. For this reason it is considered appropriate to use 
as parameters, in the verification and application of the planning model, 
three characteristics which describe the tree component of vegetation. 

The three are number of trees, basal area, and basal area growth. The 
last two are highly correlated with the volume and volume grov/th. At 
the same time they are basic physical measures of the vegetation component 
which could be related to other management areas. Number of trees 
combined with these gives an indication of the relative size of the trees. 
Selection of these parameters is not to imply in any way that the planning 
model is limited to timber related parameters. If data is available any 
parameter set could be used. The sample data from the 1974 inventory 
provided estimates of the mean vector and covariance matrix for each 
vegetation class (Appendix F ) . 

In addition to the cell^by-cell vegetation class data and the 
parameter estimates by class, other data inputs are also required. 

However, since this application is primarily experimental, no desired 
precision levels were specified initially; instead they were considered 
as variables to be included in the analysis. Cost data for the 1974 
inventory (Appendix E ) were used to generate coefficients for the cost 



Figure 8. Landsat MSS data color composite of three bands for the Quincy 
Ranger District. 
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Figure 9 . Discriminant analysij results for the Quincy Ranger District. 
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functions used with the different sampling strategies. As with the 
precision levels, cost was treated as a variable, and a number of values 
were used. Derivation of these coefficients as well as precision 
factors are discussed in the section on application results. 

3,3.2 Verification Tests 

With a study area defined and population data inputs available, it 
is desirable to verify the proper functioning of the various planning 
model components. In particular, for the population model component, 
verification of the simulation procedure and its influence on allocation 
of effort is necessary. For the allocation of effort component it is 
important to verify the functioning of the experimental SUMT algorithm 
and evaluate assumptions made with respect to the form of estimators 
used in alternative sampling strategies. 

In generating the population model for the Quincy Ranger District, 
over 172,000 individual parameter sets were simulated. To verify the 
accuracy of the simulation, averages of simulated parameter values in each 
vegetation class were compared with the input parameter values (summarized 
in Appendix F) . Table 5 summarizes the average values of the simulated 
parameters by class number; class number from Table 5 is equivalent to 
the TCC code listed in Appendix F. Differences between the average sim- 
ulated value and the input parameter values are on the order of less than 
one per cent . 

To see how much the particular sequence of random numbers used in the 
population simulation affected the overall results, two separate sequences 
were used to generate two population models. These models were then 
used in a series of allocations for various alternative sampling systems. 
The results v;ere evaluated in terms of differences in overall system cost. 
For all alternatives the differences in objective, or cost, function 
values was less than one per cent. Differences in the related allocation 
of effort were also very small. (Table H-1 and H-4, Appendix H.) 

The experimental SUMT algorithm, a major component of the planning 
model, was te.sted using a numerical agricultural example cited in 
lUiddloston et al. (1970j for a stratified random sampling application. 

In the example there are 15 strata and 7 characteristics to be estimated. 
The objective is to minimize cost subject to variance constraints for each 
of the parameters. This same problem has been solved using a different 
algorithm, FCDPAK, developed by M. J. Best at the University of Waterloo, 
Canada. FCDPAK has options for using se\/eral approaches to solving the 
allocation problem. The feasible conjugate direction technique (Hillier 
and Lieberman, 1967) was used for the agricultural test problem. The 
results of this test are summarized in Table 6, giving the published, 
FCDPAK, and SUMT allocations. This shows that FCDP.AK and SUMT give 


Number of Points in Each Result Class, and Means for Result Class Parameters 


Class No. 

Name 

Total Pts, 


Means 


0 

Null 

0 




1 

One 

0 

0.000 

0.000 

0.000 

2 

Two 

1044 

81.610 

55 . 745 

4.. 958 

3 

Three 

10896 

158.865 

117.376 

12.723 

4 


0 

0.000 

0.000 

0.000 

5 

Five 

4257 

36.472 

27.874 

2,515 

6 

Six 

9865 

84.573 

81.591 

5.837 

7 

Seven 

51001 

108.563 

75.895 

7.022 

8 

Eight 

19852 

132.544 

65.4SE 

7.932 

9 


0 

0.000 

G 000 

0.000 

10 

Ten 

12104 

40.962 

22.591 

2.425 

11 


0 

0.000 

0.000 

0.000 

12 


0 

0.000 

0.000 

0.000 

13 


0 

0.000 

0.000 

0.000 

14 


0 

0.000 

0.000 

0.000 

15 


0 

0.000 

0.000 

0.000 

16 

Sixteen 

30919 

28.786 

17.864 

1.798 

17 

Seventeen 

32512 

0.000 

0.000 

0.000 


Number of points in illegal old classes = 2 
Number of points in illegal new classes = 0 

Total number of points processed from NOVAMAP = 420000 

Total number of points in result classes = 172450 

Table 5 . Population Simulation results summary 


Table 6 . Allocation Algorithm Test Problem Results 
h PUBLISHED FCDPAK SUMT 


1 

232 

232 

232 

2 

103 

96 

96 

3 

87 

75 

75 

4 

117 

104 

104 

5 

150 

157 

157 

6 

207 

200 

200 

7 

106 

104 

. 104 

8 

92 

83 

88 

9 

87 

87 

87 

10 

369 

443 

443 

11 

98 

110 

no 

12 

33 

32 

32 

13 

83 

77 

77 

14 

27 

27 

27 

15 

49 

49 

49 

TOTAL 

1840 

1880 

1880 

COST 

39300 

40900 

40912 
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essentially identical allocation but these are both different from the 
published optimum. Further analysis showed that the variance constraints 
for the third and sixth parameters are violated using the published optimum 
sample sizes. The variance values are 523,692 and 26.447 respectively 
compared with upper limits of 507.49 and 25.98 respectively. These 
results were obtained by using the published optimum as initial points 
for both FCDPAK and SUMT with the result that the initial point was 
infeasible. 

Associated with any sampling system is the physical procedure for 
selecting the sample, making measurements, etc. Estimators are usually based 
on assumptions about the particular way in which sample units are selected. 

If the actual technique varies from the assumed technique there may be 
changes in the performance of estimators derived from the sample. Sampling 
from finite populations is one case where there is opportunity for 
deviations from the usual assumptions, since formulations of estimators 
are frequently based on sampling with replacement or from large populations. 
The formulations for variable probability sampling, one alternative 
selection procedure used in the planning model, are particularly important 
since in actual practice sample unit selections are often made without 
replacement of units. This means actual selection probabilities change for 
each selection. To investigate this effect on small and medium size finite 
populations a sampling simulator was constructed. The results of applying 
the simulator to two different populations, one with 12 units and the 
other with 115 units are summarized in Appendix G. For both populations 
the average value of Y, the estimator of the population total based on 
repeated sampling of IS units for the large population and 4 units for the 
small population, was not significantly different from the true population 
total . 


The average value of variance estimators CVC^)) however, showed 
more valiability. In particular for the small 12-unit population the 
average value of ^ CY) was about 50 per cent larger than the actual variance 
of the estimates ( Appendix G ) . In addition, estimators based on 
sampling v/ithout replacement [including the general PPS estimator but 
with selection without replacement) consistently outperformed [lower variance) 
estimators based on with replacement sampling. 

All this confirms what was expected based upon theoretical considerations. 
It also documents the usefulness of the usual PPS estimator [which assumes 
with replacement sampling) when sample selection is actually without 
replacement, with the caution that the variance estimates will be biased. 

3.3.3 Application Results 

With the population model specified and the planning model available 
it is possible to evaluate the sampling effort required to meet desired 


precision levels for each of a number of alternative sampling systems. It 
is obvious that time and money constraints on the planning phase, as well 
as the limitations of the planning model, will limit the total number of 
alternatives which might be evaluated. Nonetheless, by careful selection 
of few alternatives to be examined, it should be possible to gain 
valuable insights about the best, at least in a suboptimal sense, way to 
allocate sampling resources in those situations where we must act on the 
basis of incomplete and uncertain information. In addition, the ability 
to vary a number of factors which specify the nature and performance 
of a particular sampling system enables the analyst to interact with the 
manager to test alternative sampling system formulations. 

3. 3. 3.1 Factors Evaluated 

Analysis of sampling systems for the Quincy Ranger District is based 
on alternatives obtained as combinations of the following factors: 

1] level of confidence and precision, 2) cost, 3) sampling strategy, 

4) selection probability, 5) cluster size, and 6) correlation between 
measurement variables. Table 7 provides a detailed outline of factors 
evaluated in this case study. However, not all combinations of all factors 
were evaluated. Ti^o additional factors, the population model and relationships 
among the parameter set, obviously would affect conclusions about which 
design might be best. Taking these as given limits the generality of 
any conclusions i^hich might be made. 

Cost and precision might be factors which would be specified in 
advance, however, uncertainty as to what the real sampling costs are or 
what precision level is desired, often makes it necessary to evaluate the 
impact on design varying both cost and precision levels. 

The cost alternatives listed in Table 7 require some explanation of 
their derivation. All cost coefficients are based on the actual costs 
of the Plumas 1974 inventory, and they differ primarily in the amount of 
travel and per diem costs included. For stratified two-stage (STRAT2S] 
sampling the total costs per unit were based on the number of PSU's 
evaluated and for stratified two-stage with double (STR2SD) sampling the 
costs were based on the number of ground plots visited. Cost levels 
one and two represent the average cost per unit for each term independent 
of the form of the cost function. In level two, travel and per diem 
costs are deleted. In level three travel cost and per diem are combined 
with measurement costs. Cost level 4 includes the affect of the form 
of the function used in deriving cost coefficients. For example, in 
STRAT2S the coefficient is derived as 

TC = Cj Vn* 

3149 = Cj'/sO 
575 = Cj 
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Table 7, Factors and descriptions of factor levels utilised in application 
of the Planning model to the Quincy Ranger District. 


Factor 


Level 


Description 


1, Selection 

Probability 


2. Sampling 

Strategy 


3 Cost CSTRAT2S3 


Cost(STR2^) 


4. Correlation 
(STR2SD) 


Equal probability 

Probability proportional to sise 

STRAT2S - stratified two stage sampling 

STR2SD - stratified two stage with 
double sampling 

C^ = 20, - 25, Cg = 200 

C^ = 20, C^ == 23, Cg = 125 

Cj^ = 20, C 2 = 147, Cj = 0 

C^ = 20, C 2 = 23, C 3 = 575 

C^ = 50, C 2 = 7, Cj = 15, C 4 = 11, Cg = 175 

C^ = 50, C 2 = 7, Cg = 15, C^ = 11, Cg = 100 

C^ = 50, C 2 ^ 7, C^ = 15, C 4 = 111, C 5 = 0 

= 50, C 2 = 7, == 15, C^ = 11, C 5 = 778 

S. = 0.8 


I = 0.9 


5 Random 

Number Sequence 


Random number seed 1 
Random number seed 2 


6 . Probability 
Level for 
Confidence 
Statements 


t = 1 . 00 , 1 -a - .68 


t = 1.96, 1-a = .95 


7. Allowable 

Error 

8 , PSU Size 


^ 20 % for each parameter 
^ 10 % for each parameter 
60 X 6 pixels 


40 X 4 pixels 


10 .X 10 pixels 
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IVhere 


TC = 3149 = total cost of n* PSU's in Plumas 74, excluding Travel 
and per diem cost 

XI* = 30 = number of PSU’s sampled in Plumas 74. 

The overall set of numerical results for the Quincy Ranger District 
are summarized and tabulated in appendix H, but will be discussed and 
evaluated in the following paragraphs. 

3. 3. 3. 2 Analysis, Phase I 

The initial analysis effort considered cluster size to be fixed 
(60 X 6 pixels) and generated allocations based on combinations of 
selection probabilities (equal and PPS) , cost (two versions), 
correlations betv/een photo and ground variables (0.8 and 0 .9) , sampling 
method (STRAT2S, STR2SD) , level of confidence and precision (combinations 
of t = 1.00 and 1.96, and allowable standard error = 0.2 and 0.1 for 
each of the three parameters) . 

The results of evaluating these 32 alternatives helped to further 
substantiate the overall functioning of the planning model since a number 
of trends expected based on sampling theory were confirmed. Increasing 
the precision requirements resulted in substantially increased cost 
values regardless of the levels of other factors. However, the magnitude 
of the increase did depend on the other factors. For example, 
increasing precision from the lowest level (t = 1.00, AE = 0.2) to the 
highest (t = 1.96, AE = 0.1) required slightly more than three times 
the cost for PPS selection but more than five times the cost with equal 
selection probabilities. As expected increased correlation between 
variables (0.8 to 0.9) used for regression relationships lead to a 
decrease in sampling costs though by only about 5 per cent. 


The difference between the two cost alternatives was a. lower travel 
cost term in one case and the results as expected reduced overall cost. 
However, changing the cost alternative did not change the relative 
ranlting between alternatives. STR2SD consistently required lower costs 
for all levels of the other factors. 

3. 3. 3. 3 Analysis Phase II 

Based on these results the correlation coefficient was set at 0.8 
and was not varied in subsequent analysis. The middle two precision 
levels and one cost combination were dropped. Two additional cost 
alternatives and two additional clusters sizes (40 x 4, and 10 x 10 pixels) 
were added to the analysis. 
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The second phase of the analysis yielded additional insights into 
the relationships among the alternatives. Sampling with selection 
probabilities proportional to size consistently required a lower cost 
than equal probabilities and the relative differences between the two 
increased with larger PSU size. On the other hand, given equal selection 
probabilities, smaller PSU sizes yielded lower cost values. In changing 
from low to high precision limits the marginal cost increase is 
generally less with the systems utilizing more auxiliary information, 
that is with PPS selection and double sampling. Cost coefficients can 
change the rankings among alternatives, especially for low precision 
limits. Combining travel costs with measurement costs for both sampling 
strategies leads to the conclusion that STR2SD is better in all cases 
examined here. However, treating travel costs as a separate term in the 
cost function leads 'to the conclusion that STRAT2S is better for 
low precision requirements. 


The third cost alternative attributes a large cost to travel and 
seems to give a more realistic cost value as compared with actual costs 
required for the 1974 survey. This alternative confirms other 
interpretations since STRAT2S shows substantially less cost than STR2SD 
for low precision and PPS sampling and slightly less cost for high 
precision. With equal selection probabilities STR2SD in slightly better 
for high precision limits. 

3. 3.3.4 Sensitivity Analysis 

Four quantitative factors, correlation, probability level, 
allowable error, and PSU size, can be evaluated in terms of percentage 
changes in the objective functions as the factor level is changed. 

Tables 8 through 1 1 show results of such a comparision over the different 
levels of the remaining factors. 

Percentage reduction in cost when correlation was increased 
from 0.8 to 0.9, Table 8, was variable depending on other factors, 
particularly selection probability and allowable error. For the PPS 
selection method the reduction was about 6 per cent for a 0.1 allowable 
error and 3 per cent for 0.2 allowable error. For equal selection 
probabilities the reduction was about 3 per cent regardless of allowable 
error. 


PSU size . Table 9, appears to be affected primarily by the selection 
probability method. With PPS selection, there is generally a decrease 
in cost as PSU size increases. On the other hand with equal selection 
probability there is an increase in cost as PSU size increases. The 
percentage changes are quite variable with no obvious patterns with 
respect to the other factors. 


STRATEGY 

PROB 

COST 

P LEVEL 

AE 

F.9/F.8 




1 

.2 

-.03 



1 


.1 

-.06 

S 

PPS 


1.96 

.2 

-.06 

T 




.1 

-.07 

R 



1 

.2 

-.04 

2 


2 


.1 

-.06 

S 



1.96 

.2 

-.06 

D 




.1 

-.06 



1 

.2 




1 


.1 

■■ 




1,96 

mm 

-.04 


EQL 




-.03 




1 

.2 

fmm 



2 


.1 





1.96 

n 

-.04 





Hli 

-.03 


Table 8. Decimal percentage change in cost function, F, 
when correlation changes from 0.8 to 0.9 for 
different levels of other factors. 
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Table 11. Decimal percentagf 
probability level, 
from 68% to 9.‘i% fc 
factors . 


.8 


2 


.52 



.2 

.2 

.2 

.1 


1.10 

.48 

1.08 

,71 

.67 

1.35 

.91 

1.66 

.89 

1.69 

1.17 
1.91 
1.16 
1.94 
2.00 
2.25 

2.17 
2.39 
1.83 
1.89 
2.00 
2.03 
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Since the ratio of level two to level one of both nrobabilitv level 
aid allowable error is roughly two, the percentage change in F is about 
the same for the two factors. Tables 10 and 11. The response appears 
to be independent of correlation, but it is affected by the level of other 
factors including cost, selection probability, and strategy. In 
particular the STRAT2S strategy requires a 200 per cent increase in cost. 
Less than 200 per cent is observed with STR2SD, and even less using 
PPS selection probabilities. 


3. 3.3. 5 Optimal Model Output 

The only conclusion which holds over all levels of factors 
examined here is that PPS sampling requires a lower cost than equal 
probability sampling, and that a large PSU is better with PPS sampling 
while a small PSU size is better with equal probability sampling. These 
conclusions must be qualified by the limits of this particular population 
model and the range of PSU sizes evaluated. However, the implication 
seems to be that regardless of relationships among the variables them- 
selves, the larger PSU sizes strengthen the relationship between the 
size variable used for the selection probability and the magnitude of 
parameter totals by PSU. Conclusions about which alternative system 
is best must be qualified by the particular factors and levels which 
are utilized. 

Taking the set of inputs and model parameters which seem to be as 
realistic as possible within the limits of this analysis one may 
select an optimum system. With this system it is possible to complete 
the planning process by preparing a detailed implementation plan. 

Table 12 presents the summary niodel output and appendix J is an abbreviated 
implementation plan based on the assumptions and specifications of the 
planning model. In particular the assumed specifications for system 
performance are 1) probability level for confidence statements at 95 per cent , 
2) allowable error ofjj^lO per cent for each parameter, 3) correlation 
at 0.3 for the double sampling alternative and 4) cost factor level 4 
as shovm in Table 7 . With these assumptions in addition to those of the 
population representation, the optimum system is identified from the 
possible alternatives (Appendix H , tables H-1 to H-6) . System cost 
is $23,190 (assuming cost coefficients are in dollar units), and 
its basic elements are 1) PSU size of 60 x 6 pixels, 2) selection 
probabilities proportional to size, and 3) stratified two stage with 
double sampling. Because of the small sizes of strata 2, 5 and 6 they 
w juld be combined or included in the larger strata for implementation. 
Additional details on the sampling system as it would be implemented are 
given in Appendix J . 
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Additional discussion of the detailed model solution output is useful 
since with it one can interpret the effects of the various constraints 
on the solution as well as other factors including the stratification 
scheme. A representative summary of both input and model output are 
included in Appendix I. This example has 8 different strata and 
associated with each is a sample size for the number of PSU's (n*)j the 
number of SSU’s (n’^, and the number of direct measurement units (n) . The 
first page, labeled SUBROUTINE READIN, provides inputs of population 
conditions and other problem specifications required by SUMT. The next 
portion of output, labeled SUMT VERSION 4, shows the initial parameters 
and starting values for SUMT and then skips to POINT 194 which 
represents the final solution point. The estimate (the current value 
of x) with the cost function (F) and its lower bound (G) as well as 
second and first order estimates of the solution are given. Then the 
LaGrange multipliers, the partials of the objective function (F) with 
respect to both the and the constraints are provided. My lander et al 
(1971) provide detailed descriptions of the SUMT outputs. Finally 
SUBROUTINE SUMRIZE shows the sample size allocation by strata and 
summarizes other model inputs. 

A measure of the relative tolerance for the solution estimate may 
be obtained by evaluating (F-G)/F. Also the LaGrange multipliers for the 
constraint functions may be utilized to evaluate the relative sensitivity 
of the solution to each of the different constraints. 

In this example constraints 53 and 57 which require that (n*) (n) i 3 
for strata two and six, and constraints 18 and 22, which require that 
n'> n have the largest values. The final sample sizes confirm that these 
constraints are limiting the solution size for strata 2 (class 5) since 
(n*) (n) ~ 3,01 and (n') = (n) = 2,762, A similar situation holds for 
strata 6 (class 10) . 

The implication from all this is that the stratification procedure 
could be improved since there are several very large strata and several 
which are very small. A better scheme would be one in which the sizes 
of strata are more equal. Possible remedies include different 
stratification methods in the planning model or a review of the population 
classification results on which the population model is based. 

The possibility of several stratification alternatives always raises 
the question: should stratification be made to reduce variance or to provide 

for estimates of particular subpopulations? The answer is usually that they 
arc both important. A compromise procedure would be to use str;.'tification 
to reduce variance as much as possible, and then use "domain", or 
subpopulation, estimation procedures (Durbin, 1956; Cochran, 1963; Singh 
and Tessier, 1976) to obtain estimates for those subpopulations which are 
important for management purposes. Ifliile error terms associated with 
domain estimators are generally larger since there is no control on sample 
size, similar problems occur with stratified sampling since sample sizes 
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in each strata are generally small. In addition the number of estimates 
of subpopulations often includes many more categories than could be 
reasonably incorporated in a single stratification scheme. For instance, 
in inventories for timber management, estimates are frequently desired 
by site, by administrative units, by stand conditions, by species 
groups, by size classes, by soil type and, perhaps others as well. 


3.4 Planning for a Level II Inventory 

According to the statement of work, level II areas are 10-200 acre 
parcels which are of particular interest, presumably for management 
treatments. The precision desires are specified as plus or minus ten per 
cent at the 95 per cent level of probability for timber volume 
information. Populations of this small size are significantly different 
than level I areas (50,000 acres and larger) since not only are we dealing 
with a finite population but a small finite population. In addition these 
precision limits are quite high and the immediate implication is that a 
large amount of sampling will be required. Assuming a level I inventory has 
occurred and that Landsat classification results are available, the 
planning model would be useful as an initial indicator of the intensity 
of sampling required. The problem with the planning model is that 
simulations for populations this small would have to be replicated to 
overcome the variabilities associated with such a small population size. 

In addition different kinds of techniques , especially ratio and regression 

sampling, would have to he incorporated with very small cluster sizes and the 
option of no clustering would also need to he evaluated. The types of 
auxiliary data available and their relationships to volume data would 
need to be examined. If no ground sampling effort was allowed there would 
be potential bias in the procedures which could not be evaluated. Using 
ground sampling procedures and large scale photography for auxiliary 
data with a ratio or regression technique would still require fairly large 
sample sizes for this high precision level. Stratification of these small 
populations might be possible using medium or small scale imagery, but 
Landsat classification would likely be prohibited because of problems in 
locating precisely small sampling units. IVhile this possible application 
of the planning model would be a promising avenue for research activities 
it may be more profitable to concentrate on gaining experience and 
expanding the planning model to somewhat larger populations. 
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4.0 CONCLUSION 

4.1 Sa m Houston Nationa l Fore st Inventory 

Results of this inventory application have shown that while Landsat 
data were not useful because of extreme homogeneity of topographic and 
vegetative conditions, remote sensing data from large scale photographs 
did provide timely and cost-effective estimates of the kinds of parameters 
currently required by the Forest Survey. Tlie e.stiraated relative errors 
for volume and growth were near the desired Forest Service levels. 

Mortality and removals estimates had larger error, but the Forest Survey 
does not state a desired precision level for these estimates. By 
considering some of the opportunit ie.s for improvement which are outlined 
in Section 2.5 and with appropriate training for field crews and photo 
interpreters, it should be possible to expand and improve the system 
beyond the demonstration phase to allow estimation of all the characteristics 
of interest to the Forest Service. 

4.2 Sample Survey Planning Model 

The planning model developed here represents a formalization, 
of the interrelationships which must be considered in planning sample 
surveys for multivariate applications. As with any model, assumptions are 
required in order to simplify the '*real world" complexities so that a 
workable model can be formulated. But even with assumptions, some of which 
require additional research to evaluate their importance, the planning 
model provides an important mechanism for evaluating the various factors, data 
and components of a sample survey system. Without this formal planning 
mechani.sm, survey design is largely relegated to experience, hunches and 
guesses. This is especially true when large multi-purpose surveys are 
contemplated. 

A significant requirement for the successful implementation of a planning 
model of this kind is the availability, in a useable form, of considerable 
prior infoTTTiation about the population to be surveyed. A "data base management 
system" of the type referred to in the introduction, is probably the only 
way to insure that this kind of information is accumulated and maintained over 
time. If this level of information is available and its quality is improved 
as time passes (and more data are acquired), the planning effort becomes 
more effective and the efficiency of the sampling effort should increase. Tin- 
Landsat population model used in the Quincy Ranger District study meets this 
requirement and represents a unique application of a remote sensing data .source 
to planning for sample surveys. 

4.2.1 Quincy Ranger District Application 

In terms of the case study application to the Quincy Rangei' District 
specific conclusions arc possible, keeping in mind that extrapolating beyond 
tlic limits of this example is not wltlunit risk. 'I'he only trend which was 
con.si stent over all levels of other factor.s was that I'i'S sampling witii larger 




PSU sizes produced consistently lower cost than equal probability sampling. 

The cost function and precision limits used were the most important factors 
in determining which sampling alternative was least costly. 

More effort needs to be devoted to defining the true sampling costs, 
both the level of costs and their functional relationships to sampling 
activities. Following these factors in importance \tfere sampling strategy 
and selection probability, but they also interact with cost and precision. 

The optimum solution was relatively insensitive to the random number 
sequence used for simulation, PSU size, and correlation , at least in the range 
of values considered in this application. 

Effects on the optimum design of changes in the population represen- 
tation and relationships among the parameters were not examined. Either 
or both could have potentially significant impacts on actual results and 
also affect the sensitivities of the various other factors influencing 
the ranking of alternatives. 

In order to select an optimum system for the Quincy Ranger District 
certain model parameter's must be specified in advance. Since this 
application was basically experimental the following parameters were 
assumed to be the most realistic: 

1) probability level for confidence statements at 95 per cent 

2) allowable error at 10 per cent for each parameter 

3) correlation at 0.8 for the double sampling alternative 

4) cost factors as shown in level 4 of table 7 

With these parameters and the population representation, the least-cost 
system, among the alternatives considered, was stratified two stage with 
double sampling with PSU size at 60 x 6 pixels and selection probabilities 
proportional to size, 

4.2.2 Implications for Management and Survey Planning 

Survey design requires interaction between the resource managers and 
survey planners to arrive at a cost effective sampling system which 
meets the managers objectives. The planning model provides a formal 
mechanism for linking the two sides with each of the key links between 
the two Well defined. With the decision model specified on one side and 
the planning model on the other, the chances of successfully designing 
a sample survey to meet the decision-maker's needs are substantially 
greater, especially as the decision process and information requirements 
become more complex. 


In management applications where a cell -by-cell population model, such 
as the Landsat model developed in this study, is available, the planning 
model permits evaluation of the effects of a number of StUmpling factors 
as they interact in a particular sampling context. Variations in both 
size and shape of clusters used as PSU*s can be examined. Both equal and 
PPf sample selection probabilities may be used. Any level of precision, 
both allowable errors and probability level, may be specified, and any 
of three sampling strategies may be employed. In addition, the interactions 
among all these factors may be evaluated. Finally, co.'^t coefficients and 
their functional relationships in determining system cost are quite 
flexible since coefficients for each term are specified as model inputs. 

IVliile the planning model will not solve all design problems, its modular 
nature and use of a detailed population model provides the opportunity for 
incorporating additional capabilities, assumptions, and factors which 
are, or may become, important in the planning process. 

Finally, it should be obvious that this approach to planning for survey 
design is not limited to wildland applications or more generally even to 
geographic populations. The general approach to planning and design extends 
to all areas of sampling work. Differences in population structure 
and complexity would lead to different approaches to various components 
of the planning model. However, geographic populations offer many applica- 
tions in different subject areas including agriculture, range management, 
forestry, regional planning, and even urban planning. The key to 
application in any of these subject areas is the availability of a sufficiently 
detailed data base. 

4.2,3 Model Limitations 

Several model limitations exist, but most could be reduced or 
eliminated with further development work. The current planning model 
requires a population model based on a land use class assignment for each 
population element and supplementary parameter data for each cla.ss. This 
requirement is based on a need to be able to generate populatioii 
variability components under different sampling designs. In some cases it 
may be possible to obtain estimates of variance components from prior 
surveys with subjective modifications to make them applicable to particular 
populations. Tl^en, specific sampling systems could be evaluated to 
determine the best allocation of effort, liowever, evaluating and comparing 
several alternafives would be difficult unless variances could be expressed 
as functions of the factors which would differ among the alternative 
schemes . 

The planning model only allows three basic sampling strategies, one 
of which is impractical for most large geographic populations. In 
future studies with this and other population models, additional alternative 
sampling techniques should be investigated and incorporated. 
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The cost of evaluating an alternative with the planning iuodel is 
subject to rapidly increasing cost as the complexity of the sampling 
strategy increases. Finally the comparison among alternatives is based 
strictly on the variable costs associated with this population model . 
Incorporation of other factors in the decision process is left to the 
docision-makers . 

4.2.4 Recommendations 

Opportunities for improving the utility of the planning model can 
be viewed as 1) refinements, 2) additional capabilities, and 3) further 
application and testing. Refinements would include Ij investigating the 
possibility that more efficient or improved nonlinear allocation 
algorithms are available; 2) individualizing the formulations of inputs 
for each different parameter (for example, allowing variable correlation 
for each parameter and stratum) and 3) considering possibilities for 
increasing the number of variables in the parameter set which can be 
incorporated into the model . 

Additional capabilities might be desirable such as 1) the ability 
to stratify by different methods, 2) additional sampling strategies, and 
3) the ability to work with different population representations (such as 
aircraft multispectral data). 

Further application and testing is desirable to 1) make the software 
package more user oriented, 2) apply the model to other types of problems, 
and 3) evaluate the sensitivity of the model to varying population model 
and parameter set relationships. 

4.2.5 The Future 

As indicated in the introduction, the trend in management seems to 
be toward much more detailed evaluation of management alternatives and 
the consequences of management actions . Associated with this analysis are 
increasingly complex decision models. More detailed Information is a key 
element in the success of these activities, and in the case of wildlands 
most information is obtained from sample surveys. With more and more 
information on hand in the information system data base, sample surveys 
need to be and can be more efficient, in the sense that maximum advantage 
can be taken of available information . Further, improvements are possible 
in both the planning phase and the implementation phase of sample surveys. 
The multivariate survey planning model described here represents a 
significant first step towards taking advantage of available information 
for the design and implementation of wildland sample surveys. 


- 71 - 


REFERENCES 


Arvanitis, L.G. 5 B. Alfonza^ 1971. Use of the generalized variance 

and the gradient projection method in multivariate stratified sampling. 
B:ometrics 1971 27:119-127, 

Arvanitis, L.G. § W.G. O'Regan, 1967. Computer simulation and economic 
efficiency in forest sampling. Hilgardia 38(2) :133-164 . 

Baker, R.D., 1975. Personal communication. Department of Forest Science, 
School of Natural Biosciences, College of Agriculture, Texas A§M 
University, College Station, Texas 77843. 

Bonner, G.M., 1972. Forest sampling and inventories: a bibliography. 

Canada FMI internal report FMR-24, April 1972. Multistage refs: 

98, 140, 141. 

Chakravarti, I.M., 1955. On the problem of planning a multistage survey 
for multiple correlated characters. Sankhya 14; 21 1-216 , 

Chatterjee, S., 1968. Multivariate stratified surveys. Journal American 
Statistical Association 63:530-534. 

Cochran, W.G., 1963. Sampling techniques, 2nd edition. John Wiley 8 Sons, 
Inc. New York, 413 pp. 

Colwell, R.N. et al., 1974. ERTS-1 data as an aid to wildland resource 
management in Northern California. A report of work done by scientists 
at the University of California, Berkeley, under NASA contract no. 

NAS 5-21827, Space Sciences Laboratory, University of California, 
Berkeley. 472 pp, 

Dalenius, T., 1953. The multivariate sampling problem. Skard Aktmar- 
Tidskr 36:92-102. 

Dalenius, T. Sampling in Sweden. Stockholm: Almquist and Wikrell. 

(1) symposia of sample survey methods and theories; (8) determining 
the optimum number of strata; (9) multiparametric stratified sampling; 
(10) the problem of regional statistics (design for total and 
domains of interest) 

Dell, T.R. § J.L. Clutter, 1972 . Ranked set sampling theory with order 
statistics background. Biometrics 28:545,555. 

Duerr, W.A., D.E. Teeguarden , S. Guttenberg, N.B. Christiansen, 1975. 

Forest resource management, decision-making principles and cases, 

2 volumes. Oregon State Univer.sity Book Stores, Inc., Corvallis, 

Oregon . 

Durbin, 1956. Sampling theory for estimate.s ba.>cd on fewer individuals th.m 
the numl)cr .^elected. Bulletin of tlic International Statistical Institute 
36:113-119. 


I- 


1 


i 


Ek, A»R., 1971. A comparison of some estimates in forest sampling. 

Forest Science 17C1):2-13. 

Fiacco, A.V. § G.P, McCormick, 1964. The sequential unconstrained mini- 
mization technique for non-linear programming, a primal, dual 
method. Management Science 10:360-366. 

Fiacco, A.V. § G.P, McCormick, 1968. Non-linear programming: sequential 
unconstrained minimization techniques. Wiley 6 Sons, New York. 210 pp . 

Ghosh, S.P., 19S8. A note on stratified random sampling with multiple 
characters. Bulletin Calcutta Statistical Association 8:81-90. 

Hajek, J., 1957. Some contributions to the theory of probability sampling. 
Bulletin of International Statistical Institute 36(3) :127-133. 

Hajek, J., 1964. Asymptotic theory of reject! ve sampling with varying 
probabilities from a finite population. Annual Mathematical Statis- 
tics 35:1491-1523. 

Hansen, M.H., W.N. Murwitz, § W.G. Madow, 1953. Sample survey methods 
and theory. Volume I, Methods and Applications, John Wiley 8 Sons, 

Inc . , New York . 638 pp . 

Hartley, H.O. 8 k.R. Hocking, 1963. Convex programming by tangential 
approximation. Management Science 9:600-612. 

Hartley, H.O., 1965. Multiple purpose optimum allocation in stratified 
random sampling. Proceedings American Statistical Association, 1965. 
Social Sciences Section 258-261. 

Hazard, J.W., 1974. Design of successive forest inventories: optimization 
by convex mathematical programming. Forest Science 20(2) :117~127. 

Millier, F.S. § G.J. Lieberraan, 1967. Introduction to operations research. 
Holden, Day Inc., San Francisco. 639 pp. 

Huddleston, H.F., P.L. Clayp.ol, 5 R.R. Hocking, 1970. Optimal allocation 
to strata using convex programming. Applied Statistics 19:273-278. 

Hyde, W.F., 1976, Resources planning act: critique and alternative 
approach. Journal of Forestry 74(5) :282-284 , 

Kendall, M.G, § A. Stuart, 1968, The advanced theory of statistic?, 
volume 3, Design and Analysis and Time Series. Hafner Publishing 
Co., New York, 557 pp. 

Kokan, A.R., 1963, Optimum allocation in multivariate surveys. J.R. 
Statistical Society, Series A 126:557-565. 

Kokan, A.R. § S. Khan, 1967. Optimum allocation in multivariate surveys: 
an analytical solution. J.R. Statistical Society, Series B 29:11,5-125. 


-73- 


Lahiri, D-B., 1965. Multisubject sample survey system - some thoughts 
based on Indian experience in contributions to statistics. 

Presented to Professor P.C. Mahalanobis on the occasion of his 
7Gth birthday, Perganion Press, pp, 175-220, 

Langley, P.G., 1969. Multistage sampling of forest resources by using 
space photography - an Apollo 9 case study. Second Annual Earth 
Rc-c jurces Aircraft Program Status Review, Volume 2. NASA Manned 
Spacecraft Center, Houston, Texas. 28 pp. 

Laiiglt*> ?.C,, 1969. New multistage sampling techniques using space and 
aircraft imagery for forest inventory. Proceedings Sixth International 
Symposium on Remote Sensing of Environment, University of Michigan, 
pp. 1179-1192. 

Langley, P.G., 1971. Multistage sampling of Earth resources with aerial 
and space photography. Monitoring Earth Resources from Aircraft and 
Spacecraft. NASA SP-275. CScientific a.id Technical Information Office) 

Leopold, L.B,, F.E. Clarke, B.B. Hanshaw, § J.R. Balskey, 1971. A 

procedure for evaluating environmental impact. USGS Circular 645. 

Mahalanobis, P.C., 1952. Some aspects of the design of sample surveys. 
Sankhya 1 2 1 1- 7 . 

Miller, R.G., 1966. Simultaneous statistical inference, McGxaw-Hill 
Book Co.j New York. 272 pp. 

Morrison, D.F., 1967. Simultaneous statistical inference. McGraw-Hill 
Book Co., New York, 338 pp. 

Murthy, M.N., 1967. Sampling theory and methods. Statistical Publishing 
Society, 204/1 Barrockpore Truck Road, Calcutta 35, India, 

Mylander, W.C., R.L. Holmes, ^ G.P. McCormick, 1971. A guide to 

SUMT-Version 4: the computer program implementing the sequential 

unconstrainted minimization technique for nonlinear programming. 
Research Analysis Corporation, McLean, Virginia. 215 pp. 

NASA Goddard Space Flight Center, 1971. Earth resources technology 
satellite data users handbook. NASA Document Number 71 SD 4249. 

Naylor, T.W. , J.L. Balintfy, D.S. Burdick, K. Chu, 1966. Computer 
simulation techniques. John Wiley S Sons, Inc., New York, 352 pp. 

Raj, D,, 1968. Sampling theory. McGraw-Hill Book Co., New York, 302 pp. 

Rao, J.N.K., 1966. On the relative efficiency of some estimators in 
PPS sampling for multiple characteristics . Sankhya A28:61-70. 


- 74 - 


Remote Sensing Research Program, 1976. Mapit users handbook. Internal 
software documentation. University of California, Berkeley. 

Sampford, M.R., 1962. An introduction to sampling theory. Oliver 5 
Boyd, London. 292 pp. 

Schneeberger, H., 1971. Optimum stratification and allocation in the 

theory of sampling - in German. Unternehmens Jorschung 1971, 15;24Q-254 
(Abstract found in Statistical Theory and Method Abstract 1972(13) :1468 

Schreuder, H.T., J. Sedransk, K.D. Ware, 8 D. Hamilton, 1968. 3-P 
sampling and some alternatives, 1. Forest Science 14:429-454, 

Schreuder, H.T., J. Sedransk, K.D. Ware, 8 D. Hamilton, 1971. 3-P 

sampling and some alternatives, II. Forest Science 17: 103-118 , 

Sedransk, J., 1967. Designing some multifactor analytical studies. Journal 
of American Statistical Association 62:1121-39. 


Singh, M.P. 8 R. Tessier, 1976. Some estimators for domain totals. 
Journal of American Statistical Association 71 (354) :322-32S, 

Stuart, A., 1963. Some remarks on sampling with unequal probabilities. 
Statistical Methodology 4:773-780. 

Tikkiwal , D.B., 1960. On the theory of classical regression and double 
sampling estimation. Journal of the Royal Statistical Society 
Series B, 22:131-138. 


Titus, S.J., M.J. Gialdini § J.D. Nichols, 1975. A total timber 
resource inventory based upon manual and automated analysis of 
Landsat-1 and supporting aircraft data using stratified multi- 
stage sampling techniques . ^ Tenth International Symposium on 
Remote Sensing of Environment, 6-10 October 1975, Ann Arbor, Michigan. 

Tomlinson, R.F., 1972. Geographic data handling. Symposium Edition, 

2 vols. IGU Commission of Geographical Data Sensing, UNHSCO/IGU 
Second Symposium on Geographical Information Systems, Ottawa. 


I 


J 


1 




I - I 




i 


) 




APPENDS ICES 


I 


» 


t 


t 


A. Sam Houston National Forest Inventory description 

B. Sam Houston National Forest Inventory Estimates 

C. Survey Planning Model Version 1, Summary 

D. Programming Formulations for Sampling Strategies 
1. Plumas National Forest Cost Data 

F. Mean -Covariance Summaries for Classification of Plumas NF 

G. Sampling Simulation Results for Two Populations 

M. Quincy Ranger District Planning Model Application Results 

I. Planning Model, Input and Solution Output 

J. Implementation plan for the optimal Quincy Ranger District 
sampling system. 


t 






$ A-1 


t - - I 


I 




» 


APPENDIX A 

Sain Houston National Forest Inventory Description 
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1. Population 

1.1 Entire Sam Houston National Forest included 

1.2 Units are circular plots of .4 acre size 

2. Parameters - totals of variables of interest within plot 

3. Sampling technique and estimators 

3.1 Sampling Technique 

— Sets of (X,y) coordinates chosen at random with replacement 
Flight lines fixed by sets of coordinates 

-- Plots located along flight lines 

-- Stratum (USFS type condition class) of each plot identified 

— Simple random sample without replaGeroent of plots within each 
stratum for photo interpretation 

— Auxiliary variable measured on photo for this set of plots, 

-- Subset of these plots chosen within each stratum by simple 
random sampling without replacement; variable of interest 
measured for these units on ground 


3.2 


Parameter estimator 

= photo measurement for plot j of flight line i in stratum h 

hig 

^hij ” gtound measurement for plot j of flight line i in stratum h 

R = where summations, are over all plots for which both 

^^hij photo and ground measurements exist 


Xj^ = average photo measurement for stratum h 
= stratum h estimat ■ = Xj^ R 
L = number of strata 

= area expansion factor for stratum h (stratum size/average 
plot size) 

y = parameter estimator = 2) 

h 


I 
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3.3 Variance estimator 


V(X3 = variance of X for photo plots matching ground plots 

V(X) = variance of X for all photo plots 

V(y) = variance of y for all ground plots 

e(X,y) = covariance of X and y 

= number of photo plots in stratum 
n^ = number of ground plots in stratum 

A 

V(yj^} = stratum h variance estimator 

Vf7 -) -r VCy] + VCX) - 2R C(Xiy) , 2R C(X,y) - V(X) , g 

n., n. 


vdiere S = 1 + t 


vcx) 


(r\i75q) 2 

and where the sununation is over all photo plots 
V(y) = over-all variance estimator 

vcy) = I Nu vcyu) 

H 


4. Sample size and allocation 

4.1 30 flight lines were flown over the forest 

4.2 264 photo plots were acquired by post stratification with approximatrly 
proportional allocation among strata 

4.3 29 field plots were selected proportionally to stratum area with 
a rainiraunt of 1 plot per strata. 


5. MEASUREMENT PROCEDURES 

5. 1 Preliminary Testing 

5.1.1 Photo Acquisition and Field Work 

On March 14, 1975, twelve flight lines of photography were acquired 
over the Sam Houston National Forest. These lines contained photography 
at two different scales, 1:4000 (wide angle) and 1:1000 (on a 3 x 5 print). 
The photography was acquired using two SSnmi cameras. The wide angle 
photography was acquired using a manual advance Nikkormat with a 50mm lens. 
Forward lap of the wide angle photos was approximately 60% and these photos 
were used primarily for location of the flight lines on maps and navigation 
in the field. The larger scale photos, 1:1000, were acquired as stereo 
triplets located at the centers of the wide angle photos. These stereo 
triplets were acquired using a motor drive Nikon F and a 200fiun lens. The 
forward lap of the triplet photos was 70 - 80%. Tne flight lines were 
located so as to adequately image vegetation types found on the forest 
and to be easily accessed for ground work. 

Ground data collection con.sisted of locaiting the plot centers of 
eleven stereo triplets and identifying tree and brush individuals which 
were apparent on both the photo and the ground. The individuals were 
located on the center photo of the triplet, pinpricked and numbered. The 
individual was identified to species when possible, and when appropriate 
the diameter (DBH) and height were measured and recorded. Diameters were 
measured with a standard tape and heights were measured with a Sunto 
clinometer. The diameter and height information was to be used in defining 
photo volume estimation procedures and the species identification was 
used to train interpreters for the forest inventory photo interpretation. 

Field work was completed in four days by one man. Forest Service 
personnel from the Raven District office provided excellent field support 
by making a man available for field orientation and aiding in species 
identificatioo. An important consideration became evident during the last 
four days of field work and that was the problems of determining ground 
location fro . the photos for navigation to specific plots and establishing, 
with a high degree of accuracy, the position of those plots. The problem 
is caused by the uniformity of the terrain and the dense forest canopy. 

As a result, a majority of the training plots were chosen because of 
proximity to well defined landmarks. 

5.1.2 Photo - Ground Relationships 

Extreme uniformity and density of vegetation made individual tree 
measurements difficult or impossible. Instead average photo variables 
for a plot were related to ground characteristics. Average difference 
of parallax, a proxy for height, was rela"^ed to average plot DBH for the 
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eleven test plots and interpretations were made by three different 
interpreters. Correlations obtained were 0.838, 0.876, and 0.769. With 
these results and the well established high correlation between diameter 
and volume it was concluded that difference in parallax should be 
reasonably well related to stand volume. 

5.2 Photo Interpretation 

5.2.1 Photo Acquisition 

The forest wide volume estimation procedure utilized 30 flight lines 
of photography which were located at random over the Sam Houston National 
Forest. The photos were acquired with the same specification discussed 
in the previous section. The photos were acquired in December, 1975. 

The photos were labeled by flight line, circular plot boundaries for 4/10 
acre plots were inked onto the middle photo of each stereo triplet and 
the photo plot centers located on Forest Service maps and photo mosaics. 

The photos were printed in 3 x 5 format for both scales of photography, 

5.2.2 Photo Interpretation Procedures 

Two interpreters analyzed 264 large scale photos for a multiple 
of variables. All of the interpretation was made using an Abrams Stereoscope 
One interpreter viewed the entire set and recorded for each plot locational 
data. Forest Service condition class, numbers of trees in the plot and 
hardwoods, canopy percentage for both hardwood and conifers and the number 
of snags present. 

The second interpreter measured parallax difference. Where 
possible five trees representative of conditions in the plot were chosen. 

Only conifers werh measured. Three base to top parallax measurements 
were made for each tree using a Wild mirror stere<^scope and parallax bar. 
Thirty to forty percent of the plots had three or only one tree measured 
due to extremely dense canopies, deep shadowing in openings which were 
present or very sparse tree canopy. The parallax difference measurements 
were accumulated and the average determined for each plot. 

5.2 Field Work * 

After the photo plots were located on the Forest Service maps it 
was determined for each plot whether it was on Forest Service land and 
what type condition class in which it lay. The number of ground sample 
plots chosen in each type condition class was determined by the relative 
area of the class in relation to the total forest area. Table 1 shows the 
number of plots chosen for ground sampling in each type condition class. 

The specific plots were chosen at random without replacement from the set 
of photo plots falling in the type condition class. The large scale 
photos of the specific plots were duplicated for use by the field crew. 

In the field the crew of two men used NASA supplied high altitude photo- 
graphy (I : 60,000) for general navigation. In the area of the plot the 
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the wide angle coverage was used to navigate to the specific plot area 
and finally the center point of the circular plot was located by inter- 
preting the large scale stereo triplet. The crew measured the actual 
ground distance of the radius of the inked circular plot boundary to 
determine exact scale of the photos. Within the plot boundaries all 
conifer and hardwood trees 5*' and greater in diameter were measured. 

Data recorded for the epnifer trees included DBH, length of last five 
years growth, bark thickness, and Forest Service tree class. For the 
hardwoods, only DBH was recorded. In addition five dominant or 
codominant trees were selected and marked on the photos. Data for these 
trees included DBH, total age and height. Recent cutting was evaluated 
by recording stump diameters and the type of removal, thinning or harvest 
Type-condition of the plot was recorded and compared with Forest Service 
designation. Travel times and time on plot were recorded. The ground 
data collection effort for a total of 29 plots required 54 man days for 
completion. 


Table A-1. Number Of ground plots sampled in each Forest Service 

designated type condition classes. Number of plots sampled 
was determined by relative areas of the various classes. 
Type condition class codes are consistent with Compartment 
Prescription Handbook, USFS-FSH 2409. 21d. 


Type Condition Class 
3101 

3106 and 3110 

3111 

3112 

3113 and 3114 
3200 

46 

Hardwoods 


Number of Field Plots 

1 

5 

1 

14 

1 

5 

1 

1 
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APPENDIX B 

SHNF Estimates Based on 1976 Survey 
Notes for the Tables 

1) All estimates are in thousands of units 

2) Diameter class is inches at hroast height 

3) Relative error is the standard error of ah estimate 
divided by the estimate 
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Table 11. 


Volume of Timber on Commercial Forest Land, by Class qf Timber 
and Softwoods and Hardwoods, SHNF, 1976, 


rtNu EStHATE 

Timber Class 

■^otal code q 5 rone m cooe 20 cnne so cgbe . 

• •»•••«•« •OSS Bftoe » o«»« bo*** »«##B*#B***B**^**B*BB***««*«# 04 *< o a» r «e a«*BB«B*** 

SCM^COi- 0-^22c t n.SitP 4 0*2i/p t> 5 OtjPOE 4 7i3|, 

KAwExCOl :.6i]S 5 669. C.1R3P' 5 0*.?llE 5 0.5i3E 4 o„578P 4 

«>ioaeaa««»Boi:<t*o»*oea*so*c«s»««aB«aa'0*oo ft •«oo ••••bubboob o««««BaaB««*»*B9«4# »••«•••••• |ia**a ••«*»B*****’ ****** ' ** e ••**•*•.*• 

TC1*1, c-sass <> Oi^nE 4 o.zjifp 6 u*l2flE 6 0,693£ 4 o«^49P 4 


REUaTIvE error 


"OTaL COES P5 COHE 10 CODE ?P COOE 30 COOE 4e 

B«* I B a B* «»*« « BBC BBS *«* »«*e 4 vB Be B«B«C a-B« B > BO* a BCB*B*BeBe*Be««*»*«*««:B»*S'B*«B«**B«****»«***e *«*•*•*•*•*«••• a a ^ *c e BBBBBBBBB 

SCMwCqL 0.772E -1 O.S^Q 0.443P -1 O-US 0.465 0,4«5 

WAht^tOl- 0.303 0.4<J5 0.276 0.24 q 0.820 0,690 

** Bfi a* e* bbb « e • e« cB e««e *e aee e bbb bcb a e sbb o e bbb BBBBBBBBBB a *bb bbb bb * bBbbbbbbbAbbbbsbbbbVb^bBb aBBaBBaBaBBeeB e bbo j e« aBB**BB*BB 

TUAL C'.7SiE -1 0.742 0.7S»£ -1 (J.lO.fl 0.6?9 0,6o5 


Code Description [USFS definitions) 


S 

10 

20 

30 

40 


Mortality trees 
Desirable trees 
Acceptable trees 
Rough trees 
Rotten cull trees 
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Table 12* Volume of Growing Stock and Sawtimber on Cominereial 
forest land, by Ownership (all USFS) and Softwoods 
and Hardwoods , SHNF , 1976 


FIN»t estimate 

Diameter Class 



total 

5 TO tl 

11 

AND UP 





tmm* 



SCMwCOJ- 

0 .3?.3E 

6 0,672g 

5 

0 . 256p 

6 

H^rlCWCOt 

L’ .«llE 

5 0,276e 

5 

0.31<E 

•> 

• «•••••••:•• p««»e ••••••a «e*«« 

TCTAl 

3 .38<E 

6 0.9<i8E 

5 

3.2B7E 

6 






relative EpROR 


total. 

5 TO 11 

11 

AND UP 


SC» THCtJt 

3.766E 

*1 0.123 


0.96*P 

-I 

maFiCuCQL 

0^303 

0.15*9 


O.S.T^ 

* 








TCTAL 0.776E -1 0*9^2E -i 0*lfJ8 


Table 15. 


Volume of Girowing Stock on Commereial Forest Land, by Species 
and Diameter Classes, SHNF, 1976. 


RNal Fsthate 

Diameter Class 

total 7-9 9-,i ' 11-13 l3-l5 i5-i7 i/-;9 

®JfJ*?*^®***® ****** ‘*®!»“**’‘***«*“****»«**® »»*«*»**»**»*e»**««***fl****«»ii»**#«***e«*»**' 
SCfT^COL, 0.323E 6 5 q.2top •» 0*3'’OE 5 0*^9T,E 5 (J,&28P 5 ;.364e I OtSlA*; 

HAhl-COi G.oiiE 5 O'Tl’f 5 Q.9»i[|P < g.ftiiE 4 o,69o6 4 o.3f2= 4 r.,5n2F 4 o,32sF 

It d 4 f s e in- o tt a a a ^ ^ « 9 <t « e It 9 a sit«(tit »4*e 

TtTAl - . . - _ - _ , . - - 


19-21 2; “29 OVER 2« 

• •»•*•««(»!■ a 7 » I. e o t s •••■ a ••• 

S 0.27nE & 0*i>3^E a 0t'»'*3E 4 

4 i).. 237? 4 Q.tg3K 7 COCO 


CsBtti,c*BOa»it*»ao«eao ss.«tntB**otto(to««.o»c*****»*(H»*#*c***»**«*,6«*#fl<Ht***#»*B ,, c •« a a 9 • « e e iks******* 
D. 36 -ia 6 a,23ttP 5 o.3n8F 6 0*42iE 5 0.&59E S 0*565p 5 >t.404E 5 0 * 36 aF 5 0.294£ & g- 


0 • ig3K 

; « 9 3 a • 

g.64gE 


2 IJ. 443 E 4 


■-'CLATIVE E990R 


total '>-7 7-9 9-11 11-l3 l3-l5 l5*-l7 l/-,9 l9-2i 2l*29 OVER 29 

^r******^?® &■•-»« (tins oe **3 OB » 0 * 0 *O 9 *** 4 « *******•*•«•*•*•••*«*«•»»««•• O « « 0 «O 0 o ••.ooc o-oo O O o : 0 O-»O *•■•*•••.* 

Sut(.,cnl 0,7t6E -1 0.1/9 fl.iS2 g.ii? o.JP* 0*131 r».l77 0,26(? g.25A 0,423 It^l 

ha-lwCQL c. 3'03 0.1^1 0.l7b 0.217, Q.2fr^» 0.422 r;.439 0.'l4 0.879 i.g3 C„000 

• so«oB*Oooo«E»o*ooa«<n.o«OB«ooooo*ooflooooi.ooo»oo*o**#*,o*##,«:Oo#***,o*«i*,**o,»,»*#**,;,*„,,***„*o,aa,,*,*„a*«a*****,o* 
TC14L <;.776£ -1 n.i(j9 0.111 0.114 .158 .ll9 .1&7 .24o .257 .4 4 l,2l 
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T-:ble 14. Volume of Sawtimber on CommerGial Forest Lan4^ by Species and 
Diameter Classes « SHNF, 1976. 


riNjit fsti>«ate 


Diameter Class 



total 


11-13 


13-15 

l5-i7 


l7-l9 

l9-2l 

. Pl»23 


23-i5 

25-27 

27-2V 

OVER 29 

scMMcnt 

C *256E 

6 

i).4V0F 

S 

0.5?8f 

5 0*3546 

5 

0*3786 

5 0,27of 

5 r..224c 

• •4 

5 

#* 

0.23S6 

5 0j7««6 

4 0*rC3 

0»^*0E '* 

M<(lCVC0t 

0.31AE 

5 

0.4?UF 

4 

0.362F 

4 0*5c2E 

4 

0.3256 

4 0.2376 

4 .4646 

4 

0 • O O rv 

0* OOf) 


4 n.obn 


TCTit 

Q.2S7E 

6 

0»5S9f 

5 

0.561>e 

5 g*4046 

5 

0 . S69E 

5 0*2946. 

5 .(.27oP 

5 

0*2366 

5 O'TSAE 

4 0*5676 

4 0f4^CE 4 






RELATIVE E6R0R 




■ 




total 

lt-l3 

l3-t5 

l5-i7 

i7-i9 

l9-2i 

Pl^23 

23-?5 

25-27 

27-29 

OVER 29 

SCTTjCOL 

fj.964E *1 

0*lS6 

0*131 

0*177 

0*265 

0,258 

ii*578 

0 • 4*3s 

0*596 

0*000 

l,2l 

WAHC *!COt« 

0*534 

0.2*5 

0*422 

0*439 

0*7t4 

0,679 

t;.873 

0 • 0 Or 

0*0 (in 

1.26 

D.OOO 


O##****.0 6 ***0O *•*•*••**«**:* #*«#*:*•!»****,#***>, *J.(.**H**.«*,**.;,*,^**^**_,^>** 5, a, 5 ******* 

Ttt»L 

0*108 

O.l’B 

0*ll9 

0-l57 

0*249 

0*257 

"*5gi 

0.93s 

0*596 

1.26 

1,21 












- '■‘•/it';- ii !■■ V t^vii V i. - irk^Li. -i ' - 


B-7 


Table 16A and 18A. 10 Ycmt Volume Growth of Growing Stock and Sawtiinber 

on Coraraercial Forest Land, by Species, SHNF, 1976. 


FfN*i FSTIkaTE 


total, 5 to ii 11 AMD uo 

*iCFTuCt)C o.liie 5 n.S-lSF 4 0,796 p 4 

MARtwCnD n.oon n.nno o.ono 


TOTAI ft.lilfi 9 n,Xt9f! 4 0.796P 4 


RELiTTVjS EffROR 


, total', 5.T0 11 it AND uS,, . ... . j,. . . 

RCFTwCdC B, 6846 *1 0,?32 0l779p ■! 

HARCWCOC ,0.000 . 0»000 OjoiiO ,, , , 

tCTAl 0.6846 -1 0.i38 OlTT^F *1 • 


U U 0 


t.'3 





Table 16B and I8B. 10 Year Volume Removals o£ Growing Stock and Sawtimber 

on Gominercial Forest Land, by Species, SHNF, 1976. 


FTNA1 P^t1m4TE 



TOTAt' 

5 TO 11 

i{ Awn Up 

SCftwCdE ' 

fl.7956 

4 222. 

oil POP 4 

H*RCwC0ti 

322. 

222. 

D.dftB 

TCTAl 

O.SnTE 

4 443. 

o'.iOOB 4 


CO 

I 

» 


RFLiTIVi; EPadR 


T0T»i 5 Tn 11 11 A»n ub 


SCFTwCOt 0.569 1,13 ol775 

hahcwcoC i.iS . njoho 

TCT« 0.55l *I»766 *** otm 
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Table 2G. Mortality o£ Growing Stock and Sawtimber on Connnereial Forest 
Land, by Species, SMNF, 1976. 


final FST1«aTE 

Diameter Class 

total 5 To: 11 11 ANO UP _ . „ ' 

>4 no - ^o:»oo oooo ooooooo e o**o «••••« ••••e«o«o is »o •« o <-1 «• ••••••••••- 

StMrftat ■ n.567E 4 4?;1. 724, 

^iShEjr.m. ,H3. 34'tf. D.O'HO 

O'OOO'OO o«»o.o:o oo'o.oo oooo.it ••«»«« oooooc ooooooooooooooo ooo'ooo «••••#• oooao 0*000 •oooooooo o p 00 »« ono e « ooooooOoo 

TCIAL 1.602S 4 77P. 724. 


RELATIVE EP«OR 

total ' 5 . TO 11 11 AND UP . _ 

oaoo.oo 0*0 0 son oooooooooeooooBoooooooooooaBooooo ooooooooooooo’ooooo'oooooooooooo.ooooooooo ooooooooooo^oonoo* o« ■ 3 ooeoooo***** 

SLUVCOL O-T09 l.^a 1,2fl 

HAht,ICQU 1.4 q ^,40 O.DQO 

00 oo.**** OOOO OOOOO ••* 0*0 0 OOOOO'OPOOOOO ••*00 00 « 000000 « 000000000000 * 0 * 0 «** 0 ' 00 ** 000 * 00 * 0 * 0 ^*«««** 0 * 0 * 0 * 0 * 000 eoa* OOOOOOOOOOOOO 

TCTAL C/.744 0*9(J1 1.23 
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APPENDIX C 


Survey Planning Model Vetsion 1. Summary 


SPM is a coordinated set of computer programs written in FORTRAN IV 
and implemented on a CDC 7600 computer. 

It functions to carry out major tasks associated with components of 
the model control and allows execution of each program singly or in 
sequence . 


Four major programs are involved 

.1 SIMULATE reads data describing the population and simulates 

individual population elements. Results are saved for later use. 

.2 CLUSTER reads data produced by SIMULATE and prepares the 
population representation for sampling. This includes 
stratification, clustering into sampling units, definition of 
selection probabilities and computation of population variance 
components. 

.3 SUMT utilizes data output from CLUSTER as well as user specification 
of sampling strategy, cost coefficient, precision desires and 
other parameters, A nonlinear programming problem formulation 
is used to determine the rarapling allocation which will meet 
the precision constraints with the least cost. 

.4 COMPARE evaluates output from several alternative formulations 
of SUNfT and summarizes each beginning with the least cost 
alternative . 

A flow chart is chown in Figure C-1 to indicate interac :ions , inputs, 

and outputs of the model. 


DEFINE AND 
EXTRACT STUDY 
AREA . 


MAKE 

CLASSIFICATION 


SIMULATE 
MULTIVARIATE *- 
NORMAL 
POPULATION 


CLUSTERS 


STRATIFY 


GENERATE 

SELECTION 

PROBABILITIES 


COMPUTE 

VARIANCE 

COMPONENTS 


SELECT 

SAMPLING 


STRA' 


EGY 


INPUT ► ALLOCATE 

PROBLEM SAMPLING 

SPECIFICATIONS RESOURCES 

t 1 

NEU 

ALTERNATIVE? 

Jnq 

I YES ^ 

PROBLEM? 

|no 

COMPARE AND 
SUMMARIZE 


OBTAIN 

ESTIMATES OF 
PARAMETER SET 
FOR EACH CLASS 


INPUT 

ALTERNATIVE 

SPECIFICATIONS 


YES 


Figure C-1. Planning model flow chart 


APPENDIX D. 


Programming Formulations for Sampling Strategies 


Cost and variance functions as well as other constraints on sample 
size are included here for 1) stratified sampling, 2) stratified two 
stage sampling, and 3] stratified two stage with double sampling. First 
and second partial derivations and evaluated for both cost and variance 
functions. 


Notation, except where otherwise defined, follows that of Cochran, 


1963. 


1. Stratified Sampling Allocation 

Mtn . = 

Subject to Vp ) =1 sj (i . ^ V 
h«l h j h n 


Sir 


hr'^ "h 


+ „ -is „ , 

+ — sr- n, - 1 . 


a^TT 

-gp-y 

"h 

9^ 


4- h h = 1. 

4 


.L 




= 0 


3V{Yj) 
3 n. 


”h< 


n. 


h = l...L, j *l...m 


a^vc?.) 

0 2 " 
”h 


2 2 
■2N^S^ 
h hj 

3 

n,. 


h = I . . -L , j = 1. . .M 




= 0 


h = 1...L, j = 1...M 


h* 
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2 . Stratified Two Stage Sajatple Allpcatibn NLP Formulation 




Subject to 

(1) VC^^) <_Vj . 1...NPAR 

v(yp 

^ h»r " h ^ 1 


o - .-2 

Y .2 

3 X 


0.2 _ „ 2 2 ,J^ 

®hij ^n»j^ ■ ^ 


C 2 ] n^Mh 

(33 


maximum sample size 




(4) iijj >P-Oij 

> 0.01 

Minx 


minimum sample size 


’^l " '>2 V * Ss "h| * 

, .i: (,.2 „2 /i 1 \. "KVj 

Subject to V CYj) 


ax _ p f, Cii3 -?s 

^hl S 2 ®h 2 "h 


3x ^ p _ 

3l^ - ^2 "h 
h 


a^x "-h3 ^ -3/2 

"h 

z 

ax . « 


"h "h(- 


J = 1...NPAR 


0-3 




2. Stratified two Stage Saniple Allocation (contj 


ti.' 


.= 0 


anj^anij^ ” h2 


Bit 


= 0 


>2.,2 a 


OT . , VyM_ - i) 


Wh«;i 

■'hC 


ah. 


\m -2 »h4j ^ ^ 'i [J. 1) 
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5. Two Stage with 


1th Double Saatpling NLP Formulation 
(ignoring StratitlcaVibn) ^ 


* * rr * rr- 

Min IT = CjH + C2n n* + ** + n n + Cg^ n n 


Subject to 
(l3V(Y.):iV. j 


1 . . . NPAR (The number of parameters to be estimated) 
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vcv = 


C2) 

(3) 

C4) 






(assumes combined sampling for regression) 
(Reference: Tikkiwal 1960^p. 137) 


* 

n ^0,01 
n’^0.01 
n ^0.01 
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n * n^3 to assure sufficient degrees of freedom for variance 
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APPENDIX E 


Cost Figures - Plumas Irventory 1974 


I. Pre-photo/ground 

Tape acquisition (1021-18X63) 

Tape Reformatting 
Tape 3 tapes/date §7.75 each 

Computer Time 1 hour per date @$40/hour 

Operator hours @$5/hour 

Test Area Extraction 
Tape 3 tapes/date @7.75 each 

Computer 1 hour per date @40/hour 

Operator Ih hours per date @S/hour 

Delineation and extraction of stratification (initial) 

Photo reduction of map 

Digitizer with operator 2 hours @$43/hour 

LSR. fit of co-ordinates: Computer 6 runs @$1.80 

Operator 8 hours @$3. 00/hour 

Computer Mask Generation: Computer 

Operator 3 hours @$5/hour 

Delineation and Extraction of Administrative Boundaries 

Photo Reduction of Map 

Digitizer with Operator 8 hours @$43/hour 

LSR Fit of coordinates: Computer 9 runs $1.80/run 

Operator 10 hours $3/hour 

Conqjuter Mask Generation: Computer 

Operator Z hours @$5/hour 

Training of Classifier = 60 elasses/strip, 3 strips/date* 

Conq>uter Display terminal 24 hours 0$ 40/hour 
Image analysts 40 hours @$6/hour 
Statistical analysis; Con^uter (LBL $25.00) 

Operator 10 hours @$3/hour 

Selection of Channels § classes:, 20 hours §$5.00 


Single Date 

360.00 


23.25 

40.00 

7.50 

70.75 

23.25 

40.00 

7.50 

70.75 

15.75 

86.00 

10.80 

24.00 

136.55 

12.50 

15.00 

27.50 

15.75 

344.00 

16.20 

30.00 

406 . 00 

12.50 

15.00 

27.50 

960.00 

240.00 

25.00 

30.00 

1255.00 


100.00 
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Discriminant analysis run 

Single date: Computer (3 runs totaling^ 

Operator /analyst 24 hours @$7. 60/hour 


918.00 

182.00 


Generation and Selection of PSU’s 

Computer 3 hours @$40/hour 120.00 

Analyst 4 hours @$3.40/hour 14.00 


Location of PSU's (for aerial photography) 

Computer 10 hours @$40/hour 400.00 

Analyst 80 hours @$5/hour 400.00 


Sub-total 

II . Aerial Photography/Interpretation 


Photo acquisition 

Aircraft 25 hours @$32/hour 

Pilot 32 hours @$6/hour 

Photographer 32 hours 0$3.40/hour 

Film 60 rolls 0$3.60/roll 

Processing 60 rolls @$1.90/roll 

Printing Ave. 30 images per roll @$63/S'* x 7" 

print (1800 prints) 

Photo Interpretation 

Image analyst 360 hours @$3.40/hour 


800.00 

192.00 

109.00 

216.00 
114.00 

1134.00 


Sub-total II 


III. Ground Data Collection 

Travel 

Mileage 4590 @ 15^/mile 
Crew 

Wages 46 boys x 4 men = 184 man days (1472 hrs„ @ 3.86/hoar 5682.00 

Per diem @ $21/day 3800.00 

Sub-total III 


1100.00 


ii4.0G 


800 .00 
4488.00 


2565.00 

1244.00 

3789.00 


689.00 


9482.00 

LQ,171.0D 


IV. Data Summary and Map Generation 


Computer Analysis ground and photo data $ 280.00 

Combining ERTS and ground data 8 hours @$5.00/ 

hour 40.00 

Generation of summary statistics 16 hours 

@$5/hour 80.00 

Report prep^ation and reproduction 6 SO. 00 

Computer time 250 . 00 

Sub-total IV 1300.00 
Sub-total $19,748.00 

V. Administrative (27%) 5332.00 

VI. Overhead (30.2%. of I-V) 75 74.00 

total Cost $32,654.00 
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APPENDIX P 


Mean-covariance summaries for the classification of the Plumas NF. 


This appendix summarizes the basic population estimates for three 
parameters by vegetation type-condition class (tCC) category as follows 


TCC Description 


1 

True fir - 

reg^iieration 

2 

H _ 

immature 

3 

M 

mature 

4 

!! ^ 

overmature 

5 


poorly stocked 

6 

Mixed conifer - regeneration 

7 

ft 

- immature 

8 

tf 

- mature 

9 

It 

- overmature 

10 

ft 

- poorly stocked 

11 

East side pine - regneration 

12 

It 

- immature 

15 

tt 

- mature 

14 

II 

- ovesmature 

15 

It 

- poorly stocked 

16 

Hardwoods 


17 

nonforest 
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tiAUL ifrt/""Nc; ' mes — 

SU“ 4.53424E+02 3.#66366*’02 3.43350E*01 

HBtN l.l335tE+ci 9.t6id*JF*0l 9. 7Ca74E+00' 


VAR 8.91 6J 

fr5F+03 2.38( 

«00£+03 2,32Pj|59F*Cl 

tv e.Siy. 

k3E'^Cl 5.32'i 

I13e+01 i.SAl04E+0L 


NEW CV.-^e,..TCC 2 


Cfl^AftlJl'SCrWiTPTl! 

^ bMkl ’ XVrr NO. T Bf.ES BTCR 

1 2 3 

BASAL JLiltZi! 

NO, T^FES Z 2.34341F+03 2*386a0F+03 
-ffasi? 3 4.4dvu6E402 ' KsawK^oa 2.3zasg g?Pr 


CJaA:ELA T TC'i ~ PAT 7rr7 

SASAL AHfcA HU. T PbES BJfCF 

12 3 

BASAL AftEAl UfiCCiO 

NO> r^FES 2 •5091 l»0000 

3 TTFn IPfST l.UOUO 


SUBROUTTKE CCVMaT N ■ U ’ t* w % 



BASAL APfA 

NG, TREES 

OAGP. 

S»Jf< 

1.3«ie23EfC3 

9,40’ A (>F +02 

fl-42a23F+01 

MEAN 

6. 15430EtCl 

3.53145E^01 

4.957TOEtOO 

WAR 

6.17a8BE«03 

9.82957S+02 

1.G44 775*01 

CV 

9.e3942E*0l 

5, 66 7 9 OF +01 

6.51807F+G1 


COVARI/.SiC£ H4TF.IX 


BASAL AREA NC. TREES PlflCR 

1 2_ 3 

BASAL AREAL 6.170361^03 — 

NO, TREES 2 a.9235fef»C2 9.B7997r»fl? . 

P4GR 3 1,82102E»02 E«15a5oE-»01 U04427E+01 


COP F ELAT ICN 

KATRIX 

. 



BASAL AREA 
1 

NO* TREES 
•2 

BAGR 

3 

BASAL APEAl 
NC, T9FFS 2 

l.UCLC 

.1621 

l.CCOO 


SACK 3 

.7169 

.8053 

1.0000 
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StJa«tC»TTA£ CCVrtAT N » 21 H • 3 


SM«. . .. 

BASAt A?.6A «IC, T*»ECS 
1. 75810?*03 l.6Tai?e+03 

BACR 

1.21426E4-07 


♦'F&M 

VAP 

e.371S2£*0l 7.Mi28S*0l 
3.48ISi^E^C3 2..33400E+03 

5.78ZZ1E+00 

1.294BZE4>01 

new GV*"'...TCC 7 

cv 

7 . 04332fc «^0 1 6 .045 5 3£ « Cl 

6.22317f+01 



C0VAP.IAKCE MATRIX 




SUBRCOTIBE CCVHAT N - 70 M ■ 3 


BASAL AREA NC. TREES 
1 2 

BACR 
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• 

54$4L 4R£Al 
Kt). T?FCS 3 

3.4fil«;4e*03 

2.00T54C+C2 

2.33400E403 



basal area hit. rRELS 8AGP 
SUM 7. 52329E4C3 5.23012^+03 4.90954E+02 

BAGS 3 

1.441b 8c 402 

U51571E«02 U 

>29462E«01 


BFaB 1.074 }6E+02 7.54503tVoT^.^oi?63F-*o0 
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BASAl ARfAl 
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.7042 

1.0000 
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1 2 3 

BAG® . 3 
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BASAL AREAl 4.1952 
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NEW CY'' ’’...TCC 5 


•n 


SUM 

BASAt area NIC. rREES 
7.61380E+C3 3.75318E4-03 

BAGR 

4.57908E*02 





MEAN 

VAB 

1.312T2E+C2 6.4710UIE+01 
7.77afi5F*01 Z.45QieBE-»03 

7.87945E*-Q0 

A.E6474F*Cl 
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CCVHAT 

N ■ 7 H - 3 

cv 

6.t,53£5£«-Cl T.64Y56E+01 
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SUM 

QA&Al AKt'^ 
B.55269E+02 

Vc* IRfeeS l’.AG'» ■' 

4.76321E+02 6.19120F+01 

COVARIANCE 

MATRIX 


• 

MEAN 

V4P 
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APPENDIX 6 


Sampling Sliaulation Results for Two Populations 

Results of a sampling simulation using three variations of 
probability proportional to size sampling are summarized here for 
two small finite populations. Population one has 115 elements and 
population two has 12 elements . 


Notation 


N » population size 

R • Number of replications of the sampling procedure 


N 

i » 1 , 

*9 > estimate of Y, depends on sampling procedure and may be one of the following: 


A 

Y a 

PPSOR 


-•’t. 


where t^ = y^/pj^ 


k - 1 

and ^i * ^i 

j = 1 


k - 1 


n 


'PPS/WR 


..2 




2, . . . »n 


>2 i 

i » 1 Pi 


n 

* 1 V ^4 

PPS/WOR “ h Zj 

i * 1 Pi 


where PPSOR » Sampling with probability proportional to size for the remaining 
elements in the population (Raj. 1973, p. 57) 

PPS/WR » Sampling with probability proportional to size, with replacement 
PPS/WGR ■ Sampling with probability proportional to size, without 
replacement 

R 

^ " W 2 » where Y is one of the estimators given above 

j • 1 


MIN(Y) « min (Y^, j - 1,,,.,R) 

A A 

MAX(Y) - max (Y^, ) • 1, R) 


I 






J 


f . > 


PR0P±1 SE 


proportion of the R est lutes ^ which fall within ± 1 standard 
error of the estiute 


PR0P±2 SE X proportion of the R estiutes ^ which fall within t 2 standard error 

V(Y} “ population variance of depends on the san^ling procedure and nay be 
one of the following: 


^^^PPSGR^ “ ■ 

1 

a 

E 

i • 1 

■ c»-ii 


i 

B 

Y 

n (B-1) 

2-i 

i - 

A 

^^PPS/WOR^ 

" ^ ^PPS/«R 

) 


Etv(?)) - 4 2 

j » 1 ^ 



A ^2 

Y I 

*PPSOR‘' 




pps/im 


2 


MIN C^(Y)) « Bin (V(Y^). J » 

MAX (VCY)) » TOX j - 1*...»R] 


MINRSE • nin 



* J 


1 f • • • * R) 


MAXRSE > nax (RSE^ > j « 1,....R) 

CV (V(0(?)) - where VCY(Y)) - |=j ^ f ^ 

RUN » conputer run identifier 
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sampling without REPLACEHENT 
NtMPER or REPLICATES 
^EOO 

.«wMPER Of UNITS IM POPULATION 
llE 


Population #1 


TaMe 6>1. Population #1 data, 
population size is 
115 units 


t 9 auxiliary variable 

V • VARIABLE Of INTEHEST 
p « selection probability 


obtained fROM 


X 


UNIT IS 


1 

8 

2 

9 

3 

10 

4 

u 

5 

14 

6 

15 

7 

16 

8 

20 

9 

21 

So 

22 

It 

26 

t2 

27 

ts 

26 

14 

32 

15 

33 

16 

34 

17 

38 

18 

39 

19 

40 

KO 

44 

*1 

50 

32 

56 

:8s 

62 

34 

68 

35 

74 

36 

80 

37 

81 

38 

86 

39 

87 

90 

92 

91 

93 

92 

98 

99 

99 

34 

100 

95 

104 

36 

io5 

97 

106 

98 

102 

? 115 

52 

112 

S3 

U4 


^ ’ * * - 

55 417 
SO 118 

ST 120 

58 122 

50 12A 

51 126 

>2 128 
»3 129 

»A 480 

?5 182 

9,6 134 

V 135 

^8 136 

?9 1*0 
JO 141 


24.2000 

iiViieoo 

167.9000 
7,7000 

99.8000 

231.0000 

349.7000 

89. 6000 

278.7000 

398.0000 

97.2000 

908.0000 
287,2000 

132.9000 
333,1000 

188.0000 

108.8000 
49,6qO0 

33.6000 
69,1000 

42.2000 

42.2000 

42.2000 

24.0000 

18.6000 

65.2000 

33.0000 

198.8000 
169,9000 

233.8000 
194,3000 

354.0000 

362.0000 

164.0000 

460.0000 

442.0000 
l62,4f)00 

i9«2ooo 

iiiiftii 

107,5000 

136.0000 

432.0000 

610.0000 
121,0000 
154.3000 

422.0000 

710.0000 
138*6000 

150.0000 

145.2000 

732.0000 

130*2000 

17.9000 

109.2000 

734.0000 

133,4000 

137,8000 

94,0000 


17.2800 
53,7600 

86.4800 
5.7600 

36.4800 
101,7608 
149,7600 

63*3600 

132,4800 

153.6000 

67,2000 

192.0000 

138.2400 

97.9200 

l57,44ob 

111,3600 

115.2800 
46.0800 
23,o4QO 
90.2400 

61.4400 

61.4400 
61,4400 
38,4000 
30*7200 

44.1600 
19,2000 
76,8080 
7i,0400 

92.1600 

65,2800 
136.3209 
99.6400 
61, 4400 
l53.6aoO 
ll7,l2O0 
59,5200 
7.6800 


i 0 V 

53,7600 

32.6400 

195.5200 

182.4000 

74.8800 

63.3600 

157,4400 

1R6,24qO 

63.3600 

48.0000 

71.0400 

192.0000 
59.5200 

9.6000 

91.8AO0 

192.0000 

44,1600 

57,6080 

1 92.PCOO 


,0009 
,0046 
.0065 
.0083 
.0023 
,0090 
,0136 
,0033 
,0108 
,0i39 
.0038 
.0197 
.0111 
,0052 
*01 29 

fOOTS 
,0042 
.0019 
,0013 
.0027 
.0016 
,0016 
.0016 
.0009 
,OC07 
,00 25 
.0013 
,0077 
.0066 
.0091 
.0060 
, 01 37 

.0140 
.0071 
*01 TP 
.OlTl 
.0065 
*000T 


llV:tll8 ilil 


.0042 

.0053 

.0167 

.0236 

.0047 

.0060 

,0164 

.0275 

,0054 

.0C58 

.0056 

.0284 

.0C50 

,0007 

,0042 

,0285 

,0052 

.0053 

.0036. 


UNIT 10 

Si W 

02 i46 

S3 147 
i48 
£5 i49 

96 i52 

97 i53 

SB l54 
99 l55 

U 1?8 

U 159 
h 160 
*3 161 

74 i64 

75 i65 

76 166 

h i67 
it 170 
<9 l7i 
00 172 


81 

i73 

82 

1T6 

83 

l77 

84 

iTe 

85 

179 

86 

162 

87 

183 

80 

184 

89 

IBS 

90 

l89 

9l 

l90 

92 

194 


195 

94 

196 

V5 

200 

96 

2Qi 

97 

202 

90 

206 

99 

aoT 

lOO 

208 

1»1 

212 

102 

213 

103 

2l4 

104 

2l8 

1Q5 

2l9 

106 

220 

107 

*224 

108 

225 

1U9 

230 

HO 

231 

111 

236 

112 

237 

113 

242 

114 

243 

115 

249 


total 

AVERAGE 


^OOR 


C-S 


139.6000 

63i36Q0 

,0P54 

169.0000 

61,4400 

.0066 

728,0000 

192.0000 

.0282 

526,0000 

1 65.1200 

, 62 0 4 

29,4000 

17,2806 

• OMl 

145,8(100 

55.6000 

.0057 

750,0000 

192.0000 

,0294 

504.0000 

161. 2800 

.0195 

368.7000 

117,1200 

, 01 43 

382,7000 

161.2800 

,0148 

534.0000 

145,9200 

,O70T 

342,3000 

124.8006 

.0133 

3l3,6o00 

101,7600 

.0122 

562.0060 

192.0000 

1 02l8 

4i6,5o00 

134,4000 

.0161 

276,0000 

97.9200 

• 09 OB 

229,1000 

80.6400 

.0089 

378.0000 

192,0000 

.0147 

357.0000 

134.4000 

,0138 

154.1000 

74.0800 

,0060 

240.5000 

84.4800 

,0093 

352,0000 

126.7200 

,0136 

409.0000 

132,1800 

,0l59 

Sia.3000 

124,8000 

i0l23 

3.0000 

1.9200 

.0001 

122.8000 

72,9600 

,0048 

379,7000 

130.5600 

,0i47 

334.6000 

1|9,04QC 

,0130 

1A4.1D0O 

107,5200 

.007l 

IIO.ICOO 

53.7600 

,C043 

139.4000 

63.3600 

,0054 

54.6000 

57.6000 

.0021 

52.4COO 

26.6800 

• 0020 

78.1000 

40,3200 

.00 30 

64,8000 

5i,8400 

• 0025 

252.0000 

71,0400 

. 009P 

260.5000 

97.9200 

.0101 

112.8000 

76,8000 

, gr4* 

301,8000 

97,9200 

.011' 

178,2000 

67.2000 

,006; 

l69,4000 

99.6400 

.006( 

283,4000 

7l,0400 

.011 

39.6000 

19,2000 

. ,001' 

2}2,2000 

107.5200 

,008 

130,7000 

55.6600 

,005 

6.1000 

3.8400 

.000 

81,3000 

34,5600 

.003 

59.3000 

99,8400 

.002 

49.5000 

24.9600 

.001 

205.6000 

94.0800 

.008 

51.8000 

28.8000 

.002 

193.6000 

105,6000 

,007 

12.7000 

11,5200 

.000 

250.5600 

90.2400 

.0 09 

66,4000 

38.4000 

>002 

2»793,200oM^^9'5800 

ii.ooa 

224.2e07» 89.1381 

• 00c 


Population #2 


sampling kith RePLACGMENT 
NlHbgR or BEPtICfcTES 

ieouo 

NtHHER OF UNITS IN POPULATION 

U 


X a AUXILURT VARIABLE 

T a VARIABLE OF INTEREST 

P a SELECTION PROBABlLITT OBTAINED FROH X 


UNIT 10 X T P 


1 

PGE 

l-l6 

SB.AOOO 

^ 42.7098 

• 1298 

2 

PGE 

l*l9 

lE.7|)0n 

17.7i46 

#1073 

X 

PGE 

1*22 

9.200ft 

27,9630 

. 0629 

4 

PGE 

2-7 

ft. 5 POO 

39. 8ft It 

.0444 

9 

PGE 

2-l9 

i.ftooo 

9,5904 

• 0109 

6 

PGE 

2-31 

13.S000 

'^36,5842 

.0943 

7 

PGE 

3-4 

iS.Toon 

65,3405 

.1073 

U 

PGE 

3-7 

1ft. 4000 

58,i7q5 

• il2l 

9 

PGE 

3;l3 

16.2000 

70,ftl77 

.1244 

io 

PGE 

4i7 . 

9*2000 

'^32.9078 

• 0629 

XI 

PGE 

4-i7 

11.2000 

11.5472 

,076ft 

12 

PGE 

4-20 

10.4000 

18,8801 

.0711 


total 

14ft. 3001) 

427,0961 

1*0000 


AVETaGE 

I2.l9l7 

35,6560 

.0633 


Table 6-2. Population #2 data, population size is 12 units 


S "9 


pgroLKErcH #i n ^ 115 n = 15 200 HEPUcancNS 




PPSQR ESTIMAEIE SAHPLIM5 


PPS ESIHMAXE SRMPLINS 



WnHODT REPU£Q1E1IT 



wuh repucem^ 




RN SEED 






1 

2 

3 

1 

2 

3 

y 

10251 

10251 

10251 

10251 

10251 

10251 

E(Y) 

10198 

10290 

10308 

10195 

10325 

10357 


8404 

8262 

8330 

7806 

8187 

8162 

MAX Y 

15169 

14559 

14729 

16065 

15105 

15172 

PSEPllSE 

.7400 

.8150 

,7300 

.7350 

.7150 

.7100 

eR0Pi2SB 

.9700 

.9600 

.9400 

.9500 

.9550 

.9500 

V(Y) 

1,460,354 

1,350,601 

1,765,733 

1,857,933 

1,700,433.4 

1,812,955 

E (TO) ) 

1,407,140 

1,637,944 

1,593,888 

1,597,784 

1,962,819 

1,899,456 

MIN TO) 

91388 

*■ 

103,398 

81123 

101,001 

126,192 

136,022 

MAX V(Y) 

, 10,669,310 

9,867,814 10,837,540 

12,447,447 

16,021,953 

10,872,164 

HIN PSE 

.0353 

.0336 

.0327 

.0367 

.0385 

.0411 

MAX PSE 

.2344 

.2418 

.2343 

.2606 

.2650 

.2481 

CV V(TO)) 

1.5070 

1.2945 

1.3850 

1.6175 

1.3340 

1.3409 

HIN 

J1979CZ 

03/19/76 


FAPCa^l 

03/19/76 



Table G-3. Results of repeated sampling from Population #l with a sample size of 15 using three different samp- 
ling procedures and'three different random number sequences. 





ai-t j. -t. ■’ 


^ . .--I...- ■, 





POPULATION #1 


PPS Sampling Without Replacement 
Random Number Seed 

Random Number Seed 



1 

2 

3 

Y 

10,251 

10,251 

10,251 

ECY)^ 

10,360 

10,469 

10,483 

MIN Y 

8,469 

8,393 

8,463 

MAX Y 

16,190 

15,192 

15,405 

PROP + 1 SE 

.7450 

.7550 

.7250 

PROP + 2 SE 

.9550 

.9550 

.9250 

V(Y) 

1.592,t,il 

1,572,780 

2,025,960 

ECVCY)) 

1,706,046 

2,056,240 

1,969,355 

MINV(Y) 

116,831 

121,059 

103,009 

MAX VCY) 

14,444,207 

12,409,517 

13,625,708 

MIN RSE 

,0401 

.0361 

.0369 

MAX RSE 

.2451 

. 2567 

.2444 


CVCVCY)) 


RUN FAPCAOO 06/23/76 


Table 6-3» continued. 


G-6 


Table G-4. Results of repe; 

size of 4 using 
different randoi 










i i'r > 











POPUIATIGN #2 N = 12 n = 4 


2000 REFLIG^TIONS 



PPS ESTIMATE SAMPLING 
VJHHOUT REPLACEMENT 



RN SEED 



1 

2 

3 

4 

Y 

427.90 

427.90 

427.90 

427.90 

E(Y) 

420.78 

427.76 

428.91 

426.62 


230.27 

230.27 

230.27 

230.27 


715.17 

715.17 

704.08 

663.01 

PBOPtlSE 

.7375 

.7315 

.7495 

.7470 

PP£P:2SE 

.9825 

.9755 

.9810 

.9835 

V(Y) 

0112 

6345 

6100 

5980 

E(V(Y)) 

9195 

9029 

9054 

9323 

MIN ^(Y) 

448 

448 

448 

448 

MflX V(Y) 

34229 

33354 

39209 * 

31518 

MIN PSE 

.0381 

.0382 

.0382 

.0382 

MAX RSE 

.4477 

.4736 

.4736 

.4477 

Ci^(V(Y)) 

.6166 

.6178 

.6274 

.6117 

RUN 






Table G-4, continued. 


APPENDIX H 


Quincy Ranger District Plaimine Model Application Results 


The planning model applications results consist of six tables 
H-1 through H-6 which show sampling system cost for various factors. 


Table H-1, Allocation Results, Cell Entries Represent 
System Cost 


Strategy = STR2SD PSU Size = 60 X 6 








PRECISION 







t = 

= .100 

t = 

1.96 

PROB 

Cost 1 

CORREL 

Seed 

AE = .2 

AE = .1 

AE = .2 

AE = .1 




175 

.8 

1 

3714 

5747 

5652 

12044 


5 



2 

3718 

5760 

5665 

12075 


C- 


11 

q 

1 


5396 

■liM 

11234 





2 


5404 


11261 


G,. 


100 

g 

1 

2558 

4467 

4373 

10428 


5 



2 

2560 

4476 

4380 



10459 


Ca 


11 

.9 

X 




9762 

PPS 

4 



2 

■BS9H 


BEz^uHi 

9790 

s 

= 

0 

.8 

1 

2 

3852 



13646 


^4 

= 

111 

.9 

1 

2 






^5 

= 

778 

.8 

1 

2 

12429 



23190 


^4 

= 

11 

.9 

1 

2 






'=5 


175 

.8 

1 

2 


8956 

8978 

8728 

8759 

23845 

23895 


=4 

= 

11 

.9 

1 

2 

4415 

4421 

8560 

8578 

8351 

8369 

■BM 

BE^9B 


n 


100 

.8 

1 

3430 

7660 

7446 

22259 


B 


2 

3435 

7679 

7465 

22305 




11 

Q 

1 

3304 

7340 ^ 


21594 

EQUAL 

KJ 


« 7 

2 

3309 

7356 

BE^SHI 

21635 


B 



.8 

U 

4722 



25341 


B 

1 


,9 

H 







s 

778 

.8 

1 









0 

13387 



34944 


‘^4 


11 

.9 

1 

2 






M-2 
















































I 




Table H-2. Allocation Results, Cell Entries Represent 
System Cost 

STRATEGY = STR2SD PSU SIZE = 40 X 4 




i^n 

IH 8 

L.OO 

t = 

L.96 

PROS 



E!Sn 

AE = .1 

Ae = .2 

AE = ,1 



.8 






C 4 = 11 

.9 

HH 





C 5 = 100 

.8 




10749 


C 4 = 11 

.9 

■■ 




PPS 

n 

tn 

II 

0 

.8 

3793 



13875 

C 4 . Ill 

.9 

IB 





C 5 = 778 

.8 

12498 



23486 


C^ = 11 

4 

.9 

mil 





C 5 = 175 

.8 

mi 





C 4 = 11 


im 





Cg = 100 

.8 

3205 



19697 


G 4 = 11 

.9 

i^mmBiBi 




■ 

C 3 = 0 

.8 

4500 



22768 

■ 

C 4 == 111 

.9 

■■ 




■ 

C 5 = 778 

.8 

13193 

' 


32385 

■ 

C. = 11 

4 

.9 

imi 
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Table H-3. Alloeation Results, Cell Entries Represent 
System Cost 
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Table H-4. Alloeation Results, Cell Entries Represent 
System Cos,t 


strategy = STRAT2S PSU SIZE = 60 X 6 



I 














































Table H-6, Allocation Results, Cell Entries Represent 
System Cost 


strategy = STR2SD PSU SIZE = 10 x 10 




t = 

IflOESlHii 

t = 

1.96 

PROS 

COST 

AE = .2 

AE - .1 

AE = .2 

AE 


Cj = 200 







Cj = 23 






S = 

1415 



15274 

PPS 

Cj = 23 







4404 



67138 


C 2 = 147 






C 3 = 575 

3027 



22918 


23 






C 3 = 200 

C 2 = 23 






C, - 1Z5 

j 

2440 



24288 

EQUAL 

* 23 






=3= « 

Cj = 147 

5879 



87251 


C 3 = 575 
G2 = 23 

5729 



39129 






APPENDIX I 


Planning Model Exai^le Output 

The output reproduced here represents a t)T>ical detailed output 
siunmary for a single model run. 

The 59 constraints may be evaluated as follows: 


1-8 

max n* 

9-16 

max n* 

17-24 

max n 

25-27 

variance constraint 

28-35 

min n* 

36-43 

min n* 

44-51 

min n 

52-59 

minimum degrees of freedom constraint. 


The current value of x entries represent 1/n*, 1/n', 1/n except for 
entries under SUBROUTINE SUMRIZE which show n*, n', n. The sequence of 
entries is 


(n*j^, h=l, .. .,8) I 

(n’j^, h=l, ..,8) \ 24 total values of "X” 



5t«f riNE 

RE AO IN 

— STRATiriEO 7Wt1 

stage WtTH 

Double sa^ e AiLnCATtor 





NILS * 17 

npaR « 

3 

X B 

<0 7 B 4 






RbACI8 PPINTON 

T = 1 

.9600 RHO 

= .8000 





3U. /. 

l5. 11. 778. 



rwpR r»i*NsLS 



1 
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0/ 

o.no 

O.no 

0,00/ 

r . 0. 

0 . 0 , 
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0, 

0. 

n. 

P, 

>’t 

0. 

0. 

5 

The 

0/ 

0.00 

n.no 

o.cp/ 

0, 0. 

0* 

n. 

0, 

0. 

@ 

& 

• TI:H6P 

53/ 

12223.55 

8578,22 

9o9.49/ 

0, 0. 
.34265s*08 .29210E+08 

^*25725e*07 

n. 

• i7i75[:*o8 

.1823*6*07 

c. 

.i943«JE*0? 



,96837(i*o8 •63705E*08 

,65285e*o7 

.523116*08 

.50 6346+0 7 
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0/ 

0.80 

n.oo 

0 . 00/ 

0. 0. 

P, 

0. 

G. 

r. 


Five 

9/ 
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P t 
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n. 
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Gi 
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0.0 0 

O.QiO./ 
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n . 

n. 

0. 

0. 
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2830.34 

l«4i ,53 

102.71/ 

0. 3. 
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j49540E*ij6 

0 . 
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.139926*07 
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0/ 

0.00 
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n. 
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0. 0. 

n. 
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P. 

Ej . 



0/ 
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0 . 00/ 
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n. 

n. 

0. 

0. 
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3. 
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c. 
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c. 

0 . 



0/ 
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n. 
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Appendix J 

Implementation Plaa for the opti^l Quincy Rirnggr 
District sampline system 


1.0 Introduction 


The plan presented here for the Quincy Ranger District is intended 
only as a brief summary of a plan which would be In considerably more 
detail were the system actually to be implemented. The intent of this 
summary version is to identify the major topics which must be addressed 
prior to conducting the sample survey itself. 


2.0 Estimation objectives 

2.1 Population specifications -- The sampling population is defined as 
all the land owned and administered by the U.S. Forest Service within 
the Quincy Ranger District of the Plumas National Forest. 

2.2 parameter specifications -- Three major parameters are to be estimated 
by the sampling system: 1^ number of trees, 5.0" DBH and larger coniferous 
species; 2) total basal area in square feet inside bark, trees 5.0" DBH 
and larger, coniferous species; and 3) total basal area growth, annual, in 
square feet inside bark, trees 5.0" DBH and larger, coniferous species. 
These are the parameters upon which design is based. Other parameters 
could be incorporated, if desired, but without any opportunity to control 
precision obtained. 

2.3 Desired format for summarizing estimates -- A tabular output is 
required showing the estimate, the s^^Ie estimates of variance, the 
standard error and the relative standard error in per cent. 


3.0 Siamnary of the planning phase 

3.1 Data utilized — Spatially referenced land usc/vegetation class data 
for grid cells of roughly 1.1 acre were obtained from analysis of LANDSAT 
multispeetral data. Supplementary data from large scale photographs and 
field samples were used to estimate the parameter set and its associated 
covariance matrix for each vegetation class. Ownership data was used to 
eliminate all non Forest Service la^ds. LANDSAT data was acquired in 1972, 
sample data in 1974, and ownership data is from 1971. 
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3.2 altematiyeSj cggigidefed and selection of optiaun Systein — The 
alternatives considered have been described previously' in section 3.3.3, 
Application Results » of the main report. That material and the development 
of the best system would normally be included here. 

Particular sources of auxiliary data are summarized here since they 
effect the implementation, especially measurement techniques. The 
stratification and selection probabilities use the cell-by-cell vegetation 
class assignments (see section 3. 2. 2. 2). Correlation coefficient for the 
double sampling strategy is a reflection of the strength of the assumed 
linear regression relationship between a photo variable, x, and the 
variable of interest, y. In this application y is actually a vector 
consisting of three variables (number of trees, basal area, and basal area 
growth) the x vector would be the photo variable which best approximates 
these variables, or a proxy variable which is strongly related to the y 
variable. The correlation would also reflect the impact of other factors 
in the measurement processes including photo variables (scale, date, 
image quality, etc.) and interpretation variables such as the experience 
level of interpreters. In this application the same correlation is assumed 
for all variables. 


4.0 Sampling System Description 

4.1 Sampling model — The population to be sampled is specified in 2.1, 
Sampling units are to be formed by partitioning the area into clusters of 
60 X 6 pixels. The sampling frame is the set of sampling units defined 
by the partitioning of the population. 


4.2 Parameters — The three parameters are defined in 2.2. 

4.3 Auxiliary data -- The land use/vegetation classification results are 
to be used for stratification purposes. Interpretation of large scale 
photographs provides supplementary estimates of basal area and number of 
trees. These variables have shown strong correlation with number of trees, 
basal area, and basal area growth as measured directly at field plots. 


4.4 Sampling technique and estimators — Stratification will be accomplished 
by assigning each sampling unit to a land use/vegetation class based on the 
class with a plurabity of "good" pixels within each unit. Specifically 
there are to be 8 strata as indicated in Table 12 of the text. The 
sampling system is to be two stage with double sampling within each stratum. 
At stage I n|^ sample units will be selected in each stratum. Selection 
probabilities for stage I units are proportional to size (see text). Within 
each unit n^ second stage sample units will be selected. They will 
consist of 6.4 acre circular plots located by systematic coverage of large 
scale photographs at a scale of 1:1,000. 
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Measurements of auxiliary variables (4.3) will be made for each selected 
plot. From the set of all photo plots n. will be selected from each 
stratum for direct measurement of the parameter of interest. 


4.5 Sample size and allocation — Sample sizes based on prior analysis 
(See text table 12) is as follows: 


Stratum (h) 

m* 

"h 

”h 

"h**h 

n*: 

1 

5 

6 

1 

30 

5 

2 

1 

9 

3 

9 

3 

3 

3 

5 

2 

15 

6 

4 

36 

11 

1 

396 

36 

5 

1 

9 

9 

9 

9 

6 

1 

10 

10 

10 

10 

7 

11 

8 

1 

88 

11 

8 

23 

10 

1 

230 

23 


Total 81 787 103 

S.G Iraplementation procedures and gpecifieatlons 

This section would deal with the '^how-to-do-it" aspects of the survey 
and three major topics would be addressed; 1) sample selection procedure, 
2) measurement proceudre, and 3) data recording procedure. 


6.0 Analysis procedures 

This section would deal with the specification of method and procedure 
necessary to analyse the data, construct estimates, and summarize the results. 


